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ENERGY DISPERSED SOLUTIONS FOR THE (4 + l)-DIMENSIONAL 
MAXWELL-KLEIN-GORDON EQUATION 

SUNG-JIN OH AND DANIEL TATARU 


Abstract. This article is devoted to the mass-less energy critical Maxwell-Klcin-Gordon 
system in 4 + 1 dimensions. In earlier work of the second author, joint with Krieger and 
Sterbenz, we have proved that this problem has global wcll-posedness and scattering in the 
Coulomb gauge for small initial data. This article is the second of a sequence of three papers 
of the authors, whose goal is to show that the same result holds for data with arbitrarily 
large energy. Our aim here is to show that large data solutions persist for as long as one has 
small energy dispersion; hence failure of global wcll-posedness must be accompanied with a 
non-trivial energy dispersion. 
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1. Introduction 

This article is concerned with the mass-less energy critical Maxwell-Klein-Gordon system 
(MKG) in the 4+1 dimensional Minkowski space M 1+4 equipped with the standard Lorentzian 
metric m = diag(—1,1,1,1,1) in the standard rectilinear coordinates (x°,...,x 4 ). This 
system is generated by adding a scalar field component to the standard Maxwell Lagrangian, 

S M [A a ] := / ^-F a pF afi dxcit] 

J Rl + 4 4 

to obtain 

S[A a , 0 ] := / ~ A F ap F^ + l -D a (j)D^dxdt- 

J Rl+4 4 Z 
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Here 0 : M 1+4 —y C is a scalar function, and A a : M 1+4 —y M is a real-valued connection 
1-form, with curvature 

F a0 \= d a A 6 - dgA a . 

The connection 1-form A a is then used to define the covariant derivative 

D q 0 := (d a + iA a )(j>. 

Introducing the covariant wave operator 

□a := D“D q 


with the standard convention for raising/lowering and summing indices, we can write the 
Maxwell-Klein-Gordon system in the form 


= -j,, 

n A <t> = o ( ' 1 

where the currents J Q are defined as 

J a := -Im(0D a 0) . (1.2) 


The MKG system admits a positive definite formally conserved energy functional, 

£{A,m=£{dMA, <!>]■■= f (lEC + ^El 0 ^)* (1.3) 

• / 4 ... a ~ ... 


a,/3 


and is also invariant under the scaling 

4>(t,x) —> A _1 0(A _1 t, A^x), A a (t,x) —> A _1 H(A _1 t, A^ 1 ^). 


Thus the 4 + 1-MKG system is energy critical. 

In order to state this system as a formally well-posed initial value problem, we need to 
take into account its gauge invariance. If (A a , 0) is a solution, then so is ( A a — d a \i e* x 0) for 
any real-valued scalar function y. In the gauge covariant setting, it is natural to define an 
initial data set for MKG to consist of a pair of 1-forms (aj , ej ) and complex-valued functions 
/, g on M 4 . We say that (a, e, /, g ) is the initial data set for a solution (A, 0) if 

(Ai, Foji 0, D<0) t{t=0}= 

where the latin indices only run over the spatial variables x 4 ,...,a; 4 . The energy of the 
set ( a,e,f,g ), denoted by £[a, e, /, g\, is defined in the obvious way from (II.3j) . The a = 0 
component of the MKG system imposes the Gauss (or constraint) equation for initial data 
sets, namely 

tfe e = - Jo = Im (fg). (1.4) 

To eliminate the gauge ambiguity, we add to the above system a single scalar gauge 
condition. Here we follow the approach in [T3j and work with the global Coulomb gauge 

4 

Y, a ’ A i = 0 ( 1 - 5 ) 

3 =1 

where latin summation indices are used for summations which are only with respect to spatial 
variables. Using this gauge, the MKG system can be written explicitly in the following form 

( G\Ai = ViJ x 

\ CU0 = 0 
2 


( 1 . 6 ) 



for the dynamic variables (Ai,(ft). The operator V is the Leray projection onto divergence 
free vector fields, 

VjV = I — djA~ 1 d £ V£ 

The second equation in (11.61) requires also the temporal component which is determined 
in an elliptic fashion, together with its time derivative, by 

AA 0 = Jo, Ad t A 0 = <9% (1.7) 

Note that the first equation is precisely the Gauss equation. These equations uniquely 

determine both A 0 and d t A 0 at fixed time. 

Wcll-posedness theory of MKG at (scaling) sub-critical regularity have been studied ex¬ 
tensively in various gauges. In dimensions 2 + 1 and 3 + 1, this system is energy sub-critical, 
and hence global wcll-posedness follows from an appropriate local well-posedness result; see 
[H EJ 0J EUE! UHl ESI 12T| and references therein. In M 1+4 , almost optimal local well-posedness 
of a model problem closely related to MKG and the Yang-Mills system was proved in [9]; 

this result was then further refined in [20, 22]. For a more detailed survey of earlier works 

on MKG, see [18] Section 1.3]. 

The subject of this article, as well as its companions HOT, is the energy critical MKG- 
CG problem in 4+1 dimensions. Given an arbitrary finite energy data set for the MKG 
problem, there exists an unique gauge equivalent data set of related size which satisfies the 
Coulomb gauge condition; see PH Section 3]. Hence the main question question now is to 
decide whether each finite energy MKG-CG initial data set can be extended to a global- 
in-time solution for the MKG-CG system. This is analogous to the celebrated threshold 
conjecture for energy critical wave maps, which has been recently answered in the affirmative 
[m EH EU [261 EZl [281 |29l |30] (see also m). 

The small data global wcll-posedness result was first obtained in high dimension n > 6 
by Rodnianski-Tao [IS]. The low dimensional result n > 4 was obtained more recently by 
Krieger-Sterbenz-Tataru [13] . The theorem in [13] asserts the following: 

Theorem 1.1 ([13]). There exists a universal constant e* > 0 such that the following hold. 

(1) (Existence and uniqueness) Let (a,e,f,g) be a C°° Coulomb data set (i.e., d £ ai = Oj 
satisfying 

£[a,e,f,g\ < e* ■ (1-8) 

Then the MKG-CG system (11 . 5[) - (11.61) admits a unique global smooth solution (A, (ft) on 
R 1+4 with these data. 

(2) (Continuous dependence) In addition, for every compact time interval J containing 0, 
the data-to-solution operator extends continuously on the set (11. 8p to a map 

3 (a, e, /, g) -3 (A, (ft) G C(J] H\R 4 )) G C\J; L 2 (M 4 )) 

where the space TL 1 = 'H 1 (M 4 ) of finite energy initial data sets is defined by the norm 

\\(a,ej,g)\\ n i := ||a||^i + ||e || L 2 + ||/||^i + ||^|| L 2 . (1.9) 

The last statement allows us to define the following notion of finite energy solutions: 

Definition 1.2. Let / be a time interval. We define the space CtH l (I x K 4 ) by the norm 

IKA0)l|c t «i(7xM+ =eSSSUp(supP #1 [t]||£ lxLa + ||#]||+i xL 2) . 

ie/ v M ' 
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We say that a pair (A, 0) e CVH 4 (/ x R 4 ) is an admissible CRH . 1 solution to MKG on / if 
there exists a sequence (A^ n \ q of classical solutions to MKG on I x R 4 such that 

II (A,4>) — (A^ n \ 0^)||Ct^qjxK 4 ) as n —> oo 
for every compact subinterval J C I. 

In the process of proving the above result in [13], stronger spaces S 1 , Y 1 C C(H l ) flG 1 (L 2 ) 
are introduced, and it is shown that the above solutions obeys the bound 

IIAjIIv 1 + 11 (^ 0 ) 11*1 < \\(a,e,f,g)\\ n i ( 1 - 10 ) 

with a continuous (but not uniformly continuous) data-to-solution map on each compact 
time interval. We provide the definition!! of the spaces S 1 and Y 1 in Section [3] 

Our goal, in a sequence of three papers, is to prove that a similar result holds for all finite 
energy data. The three steps in our proof are as follows: 

Global Coulomb gauge [IT] : Here we use the above small data result to show that the 
large data problem is locally well-posed in the Coulomb gauge, and that the solution 
can be extended for as long as energy concentration does not occur. 

Energy dispersed solutions (present paper): Here we prove a more refined continua¬ 
tion criterion, namely that the solution can be extended for as long as it remains 
energy dispersed. Moreover, if the solution already exists up to £ = oo, then we 
prove that small energy dispersion implies scattering. 

Blow-up analysis [TSj: Here we complete the proof of the large data wcll-posedness result, 
showing that no blow-up is allowed at the tip of a light cone. We also prove the 
corresponding scattering result. 

At least in a broad outline, the second and third step above follow the scheme successfully 
developed in [23] and [23J in the context of wave maps. The first step in na is specific to 
the MKG problem, and is due to the long range effect of the Gauss equation as well as the 
inherent gauge ambiguity of MKG. Precisely, in order to truncate a large energy initial data 
into small energy data sets, the Gauss equation d £ ee = Im (fg) must be taken into account. 
Furthermore, the local gauges given by the small data result applied to these truncated data 
differ in their common domains, and need to be aggregated into a single global Coulomb 
gauge. An overview of the whole sequence is provided in [IHJ Sections 2 and 3]. 

Remark 1.3. To understand the issue of gauge invariance clearly, it is advantageous to take 
a more geometric point of view and consider 0 as a section of a complex line bundle L with 
structure group U( 1) = {e lx : y e R} over R 1+4 , and A as a connection on L. Since the 
base manifold R 1+4 is contractible, L is always topologically trivial; hence 0 can be identified 
with a C-valued scalar function, and A with a real-valued 1-form on R 4 by using the trivial 
connection d as a reference. The choice of a gauge then corresponds to a particular choice 
of bases on the fibers to describe ( A , 0). This viewpoint is taken to some extent in the other 
papers of the series [mm] to facilitate the usage of local gauges. In the present paper, 
however, we need not worry about such issues, as we work exclusively in the global Coulomb 
gauge. 

4 By classical, we mean that A,<j> G n“ m=0 C' t m (I; H n ). 

2 We remark that the precise definition of S' 1 differs in ns, ini and in the present paper. The difference 
is however minor, and all the theorems stated here hold with respect to any of these three definitions. See 
Remark 13.11 
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Roughly speaking, the main result in m is local well-posedness of MKG-CG for data with 
any finite energy E, with a lower bound on the lifespan in terms of the energy concentration 
scale 

r c = r c [a, e, /, g] := sup{r > 0 : Vx G R 3 4 , S Br ( x )[a,e, f, g] < 5 0 {E,el)} (1.11) 

where 5o(E , el) > 0 is some fixed functiorO , el is the threshold in Theorem II. II and £b t (x) is 
the energy measured on the ball B r (x ) of radius r centered at x. Observe that r c [a, e, f 1 g)> 0 
for any (a,e,f,g) G H 1 . 

The result in m also admits a formulation in terms of the S 1 , Y 1 norms; for that we need 
a generalization of these norms to bounded time intervals, which we denote by <S' 1 [£o, £i], 
Y%,ti] (see Section I3T21 for the dehnition). The precise statement is as follows. 

Theorem 1.4 (Large energy local well-posedness theorem in global Coulomb gauge [ITT]). 
Let (a, e, /, g) be an Li 1 initial data set satisfying the global Coulomb gauge condition cfiai = 0 
with energy E[a, e, /, g] < E. Let r c = r c [a, e, /, g] be defined as in (11. lip . Then the following 
statements hold: 

(1) (Existence and uniqueness) There exists a unique admissible C t TL 1 solution {A, (ft) to 
MKG-CG on [— r c ,r c ] x R 4 with ( a,e,f,g) as its initial data. 

(2) (A-priori S 1 regularity) We have the additional regularity properties 

A 0 G y 4 [—r c , r c ], A x , (ft G S 1 [-r c pr c \. 

(3) (Persistence of regularity) The solution (A, (ft) is classical if (a, e, /, g) is classical 0 

(4) (Continuous dependence) Consider a sequence fO ^ gGfi of Pi 1 Coulomb initial 

data sets such that 

|| (a (n) - a, e (n) - e, f {n) - /, g {n) - g)\\ n i ->• 0 as n ^ oo. 

Then the lifespan of {A^ n \ (ftfiC) eventually contains [— r c ,r c ], and we have 

||^4o — Aq 1 ^ 11y 1 [—r c ,r c ] + II (A x — A^\ (ft — (ft^) 11S’ 1 [—r- c ,r- c ] 0 OS Tl > OO. 

in other words, this result says that even if the initial data is large, we can continue the 
solution as a global Coulomb solution with good S 1 bounds for as long as energy does not 
concentrate to arbitrarily small balls. 

Our main result here is based on the notion of energy dispersion introduced in [23]. 
Adapted to our context, the energy dispersed norm we use is 

WHed^m) = sup2 k \\(P k (ft,2 fc Tfc0f)||L°°[(t 1 ,t 2 )xR 4 ] (1-12) 

k 

We measure the energy dispersion only for (ft , and not for A. The main theorem is as follows: 

Theorem 1.5 (Energy Dispersed Regularity Theorem). There exist two functions 1 -C F(E) 
and 0 < e(E) Cl o/ the energy (jl.3[l such that the following statement is true: 

If (A, (ft) is an admissible CtPL 1 solution to MKG-CG on the open interval (ti,t 2 ) with 
energy < E and energy dispersion at most e(E), i.e., 

||0|| ED(t u t 2 ) < e ( E ), 


3 In HZ] we use d 0 (E, e 2 fi « elE ~ 2 for E > e 2 . 

4 Here, by classical we mean a,e,f,9 £ D ( fL 0 H n . 
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(1.13) 


then the following a-priori bound holds: 

ll(A,«l| S ( tI , b )<F(S). 

We remark that (11.131) implies the bound (see Theorem 15.ip 

II A)||+ II (A, 0)||s 1 (n,t 2 ) ~f(e) 1- 

We also prove a continuation and scattering result, which may be applied in conjunction 
with Theorem 11.51 

Theorem 1.6 (Continuation and scattering of solutions with finite S 1 norm). Let (A, (ft) be 
an admissible CfH 1 solution to MKG-CG on [0,T + ) x M 4 , with 0 < T + < oo, obeying the 
bound 

||A||y 1 [ 0 ,T + ) + II (A, 0)||s 1 [O,T+) < OO. 

Then the following statements hold. 

(1) IfT + < oo, then (A, (ft) extends to an admissible CftH 1 solution with finite S 1 norm past 

T+. 

(2) IfT + = oo, then (A x ,(ft) scatters ast —>■ oo in the following sense: There exists a solution 
(2l£A, <ft(°°l) to the linear system 

= 0, (□ + 2iA f e ree d e )^ oo) = 0, 

with initial data AA [0], [0] e H 1 x L 2 such that 

HAW - Ac^WIAxl 2 + IWM _ 0 (oo) WIIifixL 2 0 as T -)• oo. 

Here H£ ree is a homogeneous wave wit) % Af ree [ 0] = AJ0]- 
Analogous statements hold in the past time direction as well. 

Our strategy for proving Theorem 11.51 is to use an induction on energy argument; this 
is imposed by the requirement to renormalize paradifferential interactions of the solution 
with itself. This is somewhat similar to the proof of the corresponding result for wave maps 
in |23j . See also [TO] for an exposition of this argument in the context of wave maps, and 
Section 2 in the main paper of the sequence fTS] for a brief summary of our strategy. 

Remark 1.7. We remark that the same results hold in all higher dimensions for data in the 
scale invariant space H 2 ~ 4 x H^~ 2 . We have chosen to restrict our exposition to the more 
difficult case d — 4 in order to keep the notations simple, but our analysis easily carries over 
to higher dimension d > 5. The main difference in higher dimension is that we no longer 
have a conserved energy which is equivalent to the critical Sobolev norm. However, the small 
energy dispersion guarantees that the critical energy is almost conserved. 

Remark 1.8. We note that an independent proof of global well-posedness and scattering 
of MKG-CG has been recently announced by Krieger-Luhrman, following a version of the 
Bahouri-Gerard nonlinear profile decomposition [I] and Kenig-Merle concentration compact¬ 
ness/rigidity scheme [6, T] developed by Krieger-Schlag [IT] for the energy critical wave maps. 

5 This choice is somewhat robust, in that one can freely perturb Aft. ree [ 0] by any function in A^H 1 x L 2 ) 
where A stands for dyadic summation in frequency. In particular one can take Af ree = A°°^ ■ 
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1.1. Notation and Conventions. We use the asymptotic notation A < B and A = 0(B) 
to mean A < CB for some C > 0. We write A -C B if the implicit constant should be 
regarded as small. The dependence of the constant is specified by a subscript. 

Our convention regarding indices is as follows. The greek indices a, f3, ... run over 0,..., 4, 
whereas the latin indices i,j, ... only run over the spatial indices 1,..., 4. We raise and lower 
indices using the Minkowski metric, and sum over repeated upper and lower indices. 

We refer to each directional derivative by <9 M , and the full space-time gradient by V. We 
denote the (gauge) covariant derivative by = <9 M + iA p . For (Fourier) multipliers and 
pseudodifferential operators, it is convenient to use D p = \d p , whose symbol is 

Global small constants. We introduce a string of globally defined small constants, which are 
used in our main argument contained in Sections [2HSJ 

0 < <5** « 5* « b 0 « 5i « c « 5 « 1. 

Logically, each constant is chosen to be small enough depending on the one to the immediate 
right. For the convenience of the reader, we summarize the role of each constant as follows: 
6 is the exponent for dyadic gains in bilinear and multilinear estimates, most which come 
from [13]; c enters in the gain in large frequency gaps m; 8\ is used for the gain in small 
energy dispersion; <5o is reserved for the definition of admissible frequency envelopes; <5* and 
<5** are the small constants used in the induction on energy argument in Section [6] 

Littlewood-Paley projections. Let m< 0 (r ) be a smooth cutoff that equals 1 on {r < 1} and 
vanishes on {r > 2}. For k G Z, let m< k (r) := m<o(r/2 k ) and m k (r) := m< k (r ) — m<fe_i(r); 
then supp m k Q { 2 k < r < 2 fc+1 } and forms a locally finite partition of unity, i.e., Y2k mk = 
1. Using the space-time Fourier transform J r , we define various dyadic (or Littlewood-Paley ) 
projections as follows: 

P k <p = ^[mk G^D-FM], Qjtp = J r ^ 1 [mj(\\T\ - |£||).F[^]], Sup = J r ~ 1 (in k (\(T^)\)J r [^}}. 

We also define Qf := Q ± Qj, where Q± := J r ~ 1 [l[o, 0 o)(±T)J r [y3]] restricts to the ± frequency 
half-space. For an interval / C Z, we define Pj = YlkeJ e ^ c - ^ one place, we allow P k 
to depend continuously on k e M; see the definition of (A[O],0[O]) in Section [HI 

Frequency envelopes. For some more accurate bounds at various places we need to keep better 
track of the frequency distribution of norms. This is done using the language of frequency 
envelopes. An admissible frequency envelope will be any sequence {c k } ke % of positive numbers 
which is slowly varying, 

Cjfc k < 2 Solj ~ kl 

with a small universal constant 5q. Given such a sequence and a norm A", we define the norm 

11011 X c = SUp Cjf 1 1| Tfc0||_Y- 
k 

We say that c is a frequency envelope for the data (A^O], 0[O]) if for every k G Z, we have 

||(-PfeAj;[ 0 ],-Pfc 0 [O])||jyi xi 2 < c k . 

Given any A X [O],0[O] G H 1 x L 2 , we may construct such a c by convolving with 2 -<5 °l'l, i.e., 

c k := . 

k> 
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By Young’s inequality, we have ||c ||^2 < ||(A,,[0] > 0[°])llif 1 xL2. 


1.2. Structure of the paper. In Section [2j we begin with some elliptic gauge related hxed 
time estimates. In particular these will help us relate the full nonlinear gauge independent 
energy with the linear energy associated to the MKG-CG system. 

In the following section we switch to space-time analysis, and define the function spaces 
S 1 and IV; with minor changes this follows [13]. We also recall some useful estimates from 
[ 13] ; and add to that some additional properties related to the interval decomposition of the 
S 1 and N spaces. 

In Section H] we describe the decomposition of the nonlinearity, and state the main bilinear 
and multilinear bounds which enter into the proof of our main result. To overcome difficulties 
related to large data, here we consider two additional classes of estimates, namely energy 
dispersed bounds and time divisible estimates. 

In Section Owe consider MKG waves of finite S 1 norm, and we establish further regularity 
properties for such waves. Based on these properties, we establish Theorem 11.61 We also 
consider the special case of MKG waves with small energy dispersion, and show that some 
other norms of such waves must also be small. 

Section 0 contains the proof of our main result in Theorem 11.51 This is achieved using an 
induction of energy argument, following the principles introduced in [23] . 

The following two sections contain the proof of the bilinear and the trilinear estimates, 
where, in addition to results from [13] , we bring in the energy dispersion and divisible norms. 
Heuristically, we will see that the role played by the small energy dispersion is to improve 
all the balanced frequency interactions in the bilinear estimates in Section [7] In the trilinear 
estimates in Section [3] there are possibly large unbalanced frequency interactions for which 
the small energy dispersion does not seem effective. Nevertheless, we show that the bulk can 
be bounded by a time divisible norm. This property allows us to carry out an induction on 
energy scheme as in Section |6] 

Finally, the last section contains our paradifferential parametrix construction, based on 
those in mm- While very different technically, at the conceptual level this is similar to 
the argument in [23] . The main idea there is that a large frequency gap, rather than the 
small energy dispersion, is used to control the large paradifferential term. 

Acknowledgements. Part of the work was carried out during the trimester program ‘Har¬ 
monic Analysis and Partial Differential Equations’ at the Hausdorff Institute for Mathematics 
in Bonn; the authors thank the institute for hospitality. S.-J. Oh is a Miller Research Fellow, 
and acknowledges the Miller Institute for support. D. Tataru was partially supported by 
the NSF grant DMS-1266182 as well as by the Simons Investigator grant from the Simons 
Foundation. 


2. Fixed time elliptic bounds and the energy 

While the energy (II.dp £[A,(f>] of the MKG system is gauge independent, when consider¬ 
ing the system in the Coulomb gauge it is convenient to view (A x ,<f>) as the main dynamic 
variable, while Ao and dtAo are derived quantities obtained via the equations (1 1.71) . Corre¬ 
spondingly, we view 


(A X [O],0[O]) = (A x ,d t A x ,(/),d t (j))( 0) 








as the initial data for the MKG-CG system, and determine the gauge covariant initial data 
set (a,e,f,g) via (II.7J) . We remark that (A X [O],0[O]) can be freely prescribed up to the 
Coulomb condition d l Afi 0) = 0. In this context, it is convenient to work with the linear 
energy 

E Un [A x ,(ft){t) = E Hn (A x [t],(ft[t]) := ]- I ^2 |<V4,-(f)| 2 + dx. ( 2 . 1 ) 

J /i=0,...,4 h=0,...,4 

In order to justify this, we need to show that A 0 is indeed uniquely determined by (A x , (ft) at 
each time, and that the two energies are in some sense comparable. This is the goal of the 
main result here. In the process, we will also obtain some further solvability estimates for 
the equations (11.71) for A 0 that will also come in handy in the context of space-time bounds. 
We have: 


Proposition 2.1. The following statements hold. 

(1) Let (A x ,F Ox ,<ft,~D t (ft)(0) be a finite energy initial data set for the MKG-CG system. Then 
(Ar[0], 0[O]) G H 1 x L 2 and we have the estimate 

EUMOUM) <£[A<ft] +£[A,<ft} 2 , ( 2 . 2 ) 

where £[A,(ft] denotes the energy of the initial data set (A x , F 0x , (ft, D t 0)(O). 

(2) Conversely, suppose that (A^O], 0[O]) G H 1 x L 2 . Then there exist unique solutions 
(A 0 ,<9tA 0 ) G H 1 x L 2 for the equations (11.71) . depending smoothly on (AJ0], 0[O]) in the 
above topologies. Further, £(A,(ft) depends smoothly on (AJO], 0[O]) ; and we have the 
energy relation 

E[A, (ft] < E lin (A x { 0], 0[O]) + E lin (A x [ 0], 0[O]) 2 . (2.3) 

(3) Assume in addition that 0[O] obeys the fixed time energy dispersion bound 

\m\\ED ■■= sup2- k \\(P k (ft,2- k P k d t (ft)m L °° (Mb < e (2-4) 

k 

with e <Cf[A, 0 ] 1. Then we have 

£[A,(ft\ = En n (A x [0], 0[O]) + Og[A,<j>]{^ A )- (2-5) 


Proof. All estimates here are at fixed time, so we dispense with the time variable from the 
notations. We denote the two energies £[A,(ft] and En n (A x [ O],0[O]) simply by E and En n , 
respectively. 


(1)). We begin with the spatial components of the energy, where we have 

\ E HA t - d.AjlH, < E. 

Combined with the gauge condition d J A 3 = 0, this gives the linear elliptic bound 


1 


\\A. 


2 

* 11^1 


1 


E 


fell L 2 ^ 


< E, 


i<i,fc<4 


and Sobolev embeddings further yield 









On the other hand we also have 


ID, 


lh < E. 


By the diamagnetic inequality and Sobolev embeddings we obtain 

Will* ^ IIVMIH 2 < \\D x (j)\\ 2 L 2<E. 

Then we can further estimate 

M% = II < IID^IIi, + ||A^||£ a < E + E- 

Next we turn our attention to the temporal components. We first have 

— \\dtA x — VWL 0 ||i 2 < E. 

Applying the divergence and using the Coulomb gauge condition we obtain 

IIAjII^-i < ||AA 0 ||^_i < E. 

As the energy E also controls ||D t 0||| 2 , arguing as above we also obtain 

mwi, < e+e\ 

which concludes the proof of (12.21) . 


(2)). We begin with the analysis of the first equation in fl 1.71) . which is rewritten as 

6| 2 )A 0 = -Im (0c^0). 


-A 


We hrst need to know that this equation is solvable. More generally, we consider the inho¬ 
mogeneous problem 

(—A+ |0| > = / (2.6) 

The solvability of this equation is dealt with via the following fixed time lemma: 

Lemma 2.2. Consider the equation (12.61) with f E H 1 . Set E 0 = ||0||^i. Then 

(a) If f E H -1 then there exists a unique solution u E H 1 , satisfying 

||d 


I H 1 ~ 


H~ 


Further, the map (0, /) -+ u is smooth in the H 1 x H 1 —>■ H 1 topology. 

1 * 3 

(b) If f E H~ 2 then there exists a unique solution u G H?, satisfying 


W\\ k i ^ E 0 


1 


Further, the map (0, /) —)■ Aq is smooth in the H 1 x H 2 — > Ft? topology, 

(c) In addition, for any frequency envelope c G C we have the bounds 


Ml Hi ~e 0 


H7 i 


\U\ 


■A r^jEo 


Proof, (a) By Sobolev embeddings we have 


liA-sA- 1 ~ IM 


H, 


l < Mil 




(2.7) 


( 2 . 8 ) 


(2.9) 


Hence the operator —A+10| 2 is bounded from H 1 —tH h It is also self-adjoint and coercive, 
so the bound 


Ml A 1 — 


H- 1 
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immediately follows. The regularity of the map (0, /) —>• u is obtained in a similar manner, 
by looking at the linearized equation. 


(b) More generally, we will take / G H a and prove that we can solve for u G H a+ 2 for any 
—2 < cr < 0. This in particular easily implies the frequency envelope bounds in part (c) By 
duality it suffices to consider the case — 1 < a < 0. 

To solve the problem perturbatively in H a+2 it suffices to construct a multiplier A so that 
A(0 |£r +1 and 

IIAI^fA- 1 - « 1 (2-10) 

Then we can rewrite the equation as 

(-A + |0| 2 )Aw = A/ - (A^fA" 1 - |0| 2 )Au, 

and the above estimate allows us to solve the above equation perturbatively based on the 
H 1 solvability in part 


By duality and a Littlewood-Paley decomposition, (12.101) would follow if we had the 
stronger bound 




AtifcjA ^kl 0/c,3 0/^4 


< IMI^i 


( 2 . 11 ) 


We will denote each summand on the left hand side by I(ki, fc 2 , k 3 , fc 4 ). To achieve (12.111) we 
will choose A radial, with the property that A(r) is non-decreasing and 

/ s \ 

A(s) < A(r) , s > r. 

Estimating each dyadic contribution using Sobolev embeddings we have 


^/c 1 ^*^4 


< 2 (k max ^»)|| Mfcl ||^ 1 || Ufc2 ||^ 1 ||0 fc3 ||^ 1 ||0 fc4 | 


H 1 


and similarly 


AWfcjA 


< T {k max fe ™»)||^ i ||^ 1 || Mfe2 ||^ 1 ||0 fe3 ||^ 1 ||0 fc4 | 


m 


where k max = rnax{/q,..., fc 4 }, /c min = min-fAq,..., fc 4 }. Hence contributions from widely 
separated frequencies are small. To measure that, we fix a frequency gap parameter m (which 
will be chosen depending only on E 0 ) and split 

I — Iclose ~t~ I far •— ^ ( I{k\i ■ ■ ■ ■ ^4) T ^ ^ I{^h ■ ■ ■ : fel). 

kmax kmin < ^-'m kmax kmin^.’kkl 

For I f ar we have 

Ifar<2^\\u\\%M\h 

which can be made sufficiently small by choosing m large enough compared to E 0 . For I c i ose 
we use the off-diagonal decay to obtain 

Iclose <m ll^ll^fl ||0||jgU 2 

Hence only the large dyadic parts of 0 have nontrivial contributions. To account for those, 
we choose a finite set of dyadic indices K C Z outside of which we have 

UWB'JiKc) : = SU P 2^||0fe||L 2 1. 
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( 2 . 12 ) 















Note that the number of indices in K can be bounded by a constant depending only on Eq. 
Since I c i OS e only allows interactions of frequencies at most m apart, it is natural to expand 
K by m to K rn {k + k' : k £ K, \k'\ < m}. Then all unfavorable (i.e., large) interactions 
occur only for frequencies within K m , i.e., 

Iclose — E I{k\i ■ ■ ■ , Ik&) T 'y ^ h(fci, •.., Iki) 

• [kmin ■>^max\ki : \krnin ^max\ ^0 

11^11^1 ||0|liJl,2(x c ) T "y ^ I(ki, . . . , /fc 4 ). 

ki • [^min )^max] 

The hrst term on the last line is small enough thanks to (I2.12p . The second term can be 
eliminated altogether by refining the choice of A. Precisely, we set A(r) to be a piecewise 
smooth function which is constant for log 2 r £ K 2m and equals an appropriate constant 
multiple of r a+1 outside. Then it is easy to check that I (hi, ..., fc 4 ) = 0 if k 2 £ K m ; hence 
(12.lip follows. Furthermore, since m and the number of indices in K are bounded by Eq, it 
follows that A(r) « Eq r a+l as required. 


ll^ ) illi : f‘ T o+ 2 <e 0 2 ^ '||/fc|lij^o • (2-13) 


(c) For cr 0 = — 1, — | and f = fk we claim that 


In fact, a similar bound holds for any — 2 < cr 0 < 0 with 5 > 0 depending on a. By linearity, 
we may fix k, and by scaling (which leaves Eq invariant), we may assume that k = 0. Then 
the bound (I2.13P follows by applying (b) with — 2 < a < ao and cr 0 < cr < 0 to control the 
solution 0 in upper and lower Sobolev spaces, which implies that <pj decays in L 2 away from 

j = 0. □ 


We now continue the proof of part (2) of Proposition 12.11 From part 
lemma we obtain the estimate 


of the above 


ll^olli/i ^ ll^llit 1 ll^t^lU 2 ^5 Bun- 

Then, using the embedding H 1 C A 4 , we directly obtain the estimate (12.3p . 


(3)). Comparing E with Eu n we have 

E = E lin + 4nO(P0 || L2 + II V4,M + O(P0||i 2 + || VAoWb) 


therefore it suffices to establish the bounds 

P0IU 2 ~E lin e 3 , ||VA 0 || L 2 < Elin £4. (2.14) 

The hrst is easily obtained using the standard Littlewood-Paley trichotomy. For high-low 
interactions we have 


IIA'^fclU 2 2 fc j ||A j || i 2||0 A .|| ED , j > k 

For low-high interactions we have 

Finally for high-high interactions we have 

WPMMWv £ 
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3 < k. 

















In all cases we have favorable off-diagonal decay, so the l 2 dyadic summation for the output 
is inherited from A. Hence the first bound in (12.14 p follows. 

For the second bound in (12.14p we use the lemma to reduce it to 

WWM h-i S E lin e*. 

The argument for this is similar to the one above, and is left for the reader. □ 

3. Space-time function spaces 

3.1. The S 1 , N, Z and Y 1 spaces. We begin our discussion with the function spaces 
introduced in pT3j , namely S 1 for the MKG waves and N for the inhomogeneous 

terms in both the □ and the IZU equation. These are spaces of functions defined over all 
of R n+1 , together with the related spaces S and N*. They are all defined via their dyadic 
subspaces, with norms 

II« = £WIL xz { s , s \ n } 

k 

We recall the definition of their norms. With minor modifications at high 
follow [13]. For Nk we set 

N k = L 1 L 2 + X 1 0 ’" 1 , 

where 

M#* - (XhX>“ 2W iiA<wiiz 

k j 

The Nk norm is the same as in [13]. 

The S k space is a strengthened version of IV/, 

C S k c L°°L 2 n = N*, 

while S}„ is defined as 

/c 4k 

ll0IU; = l|V<#.|k+2-5||n^,|| iai2 +2-ir||n0|| i§i2 . (3.3) 

Compared to [13] we have loosened the P 1 summability of the n _1 L 2 L 2 norm and added the 
norm above. Both of these modihcations are of interest only at high modulations. 
The exact exponent 9/5 is not really important, for our purposes it only matters that it is 
less than two and greater than 5/3. 

Remark 3.1. In mi. yet another definition of the S 1 norm is employed, namely 

IHIsJ = ||V0|| 5fe + 2-i||n0|| L2L2 . 

Our justification for keeping the same notation S 1 (besides notational simplicity) is that the 
difference among these three definitions is minor. For a solution to MKG-CG, one can easily 
pass from one definition to another using the high modulation bounds in Propositions 14.41 
and 14.101 In particular, in every theorem stated in the introduction, statements with respect 
to one of these definitions of S 1 easily implies those with respect to others. 
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modulations, we 

(3.1) 

(3.2) 










We now recall the definition of the space Sk from [13]. The space Sk scales like free waves 
with L 2 x H initial data, and is dehned by 

Ml = Mir + II^H sr + IHIU > 

^oo 

where: 

11 ^ 11 si tr = sup 2 { * + r- 2)k \\((f),2- k d t (j))\\ L c IL r , 

2<q,r,<oo, ±+^<f 

uwiiy = ^2\\ F TQ<k+2i(/>\\%;(D with 1 = \ J -^i 

UJ 

The S k tr norm controls all admissible Strichartz norms on M 1+4 . The w-sum in the definition 
of S^J 1 is over a covering of S 3 by caps u of diameter 2 l with uniformly finite overlaps, 
and the symbols of Pf form a smooth partition of unity associated to this covering. The 
angular sector norm S k (l) combines the null frame space as in wave maps [25j [31] with 
additional square-summed norms over smaller radially directed blocks Ck'(l') of dimensions 
2 k> x ( 2 k ' +l, ) 3 . We first define 


||0IU“" 9 = sup IM|s“” 9 +2I , 

(3.4) 


W^Wne 


inf 



u u 


I L\ ,(L 

±LU' v 


\du' 




) 


su V \WJ\\ l ~ { l> ± ) , 

uj “ 


where the norms are with respect to £± = t ± uj • x and the transverse variable in the 
hyperplane (i.e., constant hyperplanes). Moreover, y^ denotes tangential derivaties on 
the (■£+)-*- hyperplane. As in [13], we set: 


\mi W) = iwi! r + 2- 2k m 2 NE+r 3k Y,\\Q ± <i>\\m-m 

± 

+ sup {WPCyil'Mlgr +2~ 2k \\Pc k ,(l')<i>\\ 2 NE 

k'^k,l %0 r n ,s v 
k+2Kk'+l'^k+l k ' y 1 

+ 2- 2 '='- fc ||f 4 ,(r,0||i 2 , 1 „ ) + 2~W + ‘"> V WQ^Pc^W , (3.5) 

± 

where the Ck'{l ') sum runs over a covering of M 4 by the blocks Ck'{l') with uniformly finite 
overlaps, and the symbols of Pc k ,{V) form an associated partition of unity. We also define the 
smaller space S| C Sk (see the bound (13.71) below) by 

IMU = IMk + UVulU-is. 

K 

On occasion we need to separate the two characteristic cones {r = ±|£|}. Thus we define 
the spaces N k .±, S k ± and N k± in an obvious fashion, so that 

N k = N ki+ n S* = S{, + + Sj,_, 
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N* k = N* kj+ + N k _ . 




Next we describe an auxiliary space of the type L 1 (L°°) which will be useful for decom¬ 
posing the nonlinearity: 

U\\z = ^ W\\z k , \\<t>\\ 2 z k = sup ^ 2^11^+2^1121^00) • 

7 1<C.C , . 

k oj 

Note that as defined this space already scales like H 1 free waves. In addition, note the 
following useful embedding which is a direct consequence of Bernstein’s inequality: 

□ _1 L 1 (L 2 ) C Z . (3.6) 

Finally, the function space for A 0 is simple to describe, since the A 0 equation is elliptic: 

IIA)||yi = ||VA)||W + ||VAo|| 2 2 ^ , 

where we recall that V denotes the full space-time gradient. 

Let E denote the linear energy space, i.e., 

E = H 1 x L 2 . 

One of the results in [13] asserts that we have linear solvability for the d’Alembertian in our 
setting. 

Proposition 3.2. We have the linear estimates 

l|v<A||s< II#)]IIe + IP0I|n , (3.7) 

IMI* ~ ' < 3 ' 8 > 

Here (j3.7[) is the embedding S'” C S, whereas (13.8p follows immediately from (13,7p . 

3.2. Interval localization. So far, we have described the global setting in [13]. However, 
in this article we work on compact time intervals, therefore we also need suitable interval 
localized function spaces. This is not straightforward, since our function spaces are defined 
using modulation localizations, which are nonlocal in time. To start with, we take the easy 
way out and define 

IHIsM/] = in f ||0 ||sh ||/lk[/] = inf ||/||n (3.9) 

4>=<t>\i f=f\i 

However, the next result allows us to simplify somewhat these definitions: 

Proposition 3.3. (1) Consider a time interval /, and its characteristic function xi- Then 
we have the bounds 

Ilx/0||s ^ Ms, WxiIWn < ll/H at, (3.10) 

The latter norm is also continuous as a function of I. We also have the linear estimates 

l|V0in m < MOIIIE + Iin^ik,], (3.11) 

PI| S .|/] Jdl0[O]l| E + IP0ll (K n iIA -i ni * # -S, [/r (3.12) 

(2) Consider any partition I = [jh- Then the N norm is interval divisible, i.e. 

Yi II/I&m ~ ll/llwn < 3 - 13 ) 
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and the S and S 1 norms are interval square summable, i.e. 


1101 


2 < 

S[I] ~ 


E imi 


2 

S[I k ] ’ 


IM& W s E IM&W 


< 

rv / , 

k 


(3.14) 


We remark that a consequence of part (1) is that, up to equivalent norms, we can re¬ 


place the arbitrary extensions in (j3.9j) by the zero extension in the N case, respectively by 
homogeneous waves with (0, d t (j)) as the data at each endpoint outside / in the S 1 case. 


Proof. (1)). It suffices to prove the desired bounds for frequency dyadic pieces of 0 and /. 

In the N case it also suffices to work with the space LfL 2 + X x ' 2 . But in this case this is 
exactly the proof of (158) in [23], where just steps 1 and 2 are needed. 


0 — 

By duality, we have the same bound for L°°L 2 D X^ 2 , which is a part of the S norm. We 
now consider the remaining parts of the S norm. The only difficulty is with the Sf n9 norm, 
due to the modulation localization. Fixing a modulation scale j = k + 21, we consider either 
Q%j{xi4>) or QfjiXif 1 )- There are two cases to consider: 

(i) Short intervals, |/| < 2~K Then 

Il<9o-(w0)lkr ^ ll<5< 7 (x/0)ll n i < < IHU °° l * 


X 


(3.15) 


(ii) Long intervals, |/| > 2 L Then we write 

QtjXifi = Q<j-3oXiQtj-2<i> + QtjiQ^-wXiQtj^fi) + QtjixiQtj - 2 0 ) 

For the first term we use the bound 

||Q<j-3O\ / rQ< J -20||5“" 9 ^ 11 Q <j—30Xl 11 L°° 11011 ^ 11011^ 

which was provec0 in [T7j, Lemma 7.1]. The other two terms in (13.151) are estimated in L 2 L 2 
as in (i): For the second term, we use 

||g+.(Q>,--3oX/g^_20)IU“- ~ 22 ||g > j-3OX/g< :? '-20lU 2 n 2 ~ 22 \\Q > j-mXl\\LA\Q<j-2 ( i ) \\L°°L? 

^ ||0||l°°l 2 - 

In the last inequality, we used the bound 

WQiXiW* ~ 2 - 2 , 

which follows from Plancherel in t and the fact that the Fourier transform of the \i is a 
suitable rescaling and modulation of sinr/r. Finally, the third term in (13.15(1 is treated as 
follows: 

\\QUxiQt 3 -2^)\\s: 7 < dWQti-Miw Z 1101k • 

The S 1 bound 


(2)). The N bound (13.13ft is exactly as in Proposition 5.4 in (159) in 


(13.14ft reduces easily to the corresponding S bound. The bound (13.14ft for the N* part of the 
S norm follows by duality from (13.13ft . Of the remaining components of the S part we have 


the same difficulty as in part (1) namely with the modulation localizations occurring in the 


Sf 19 norms. The solution is also the same as in part (1) precisely that for each modulation 
scale j we split the intervals into short and long, and estimate the two contributions as above: 


^Technically speaking, H2 Lemma 7.1] is stated for \i which decays in space, but we may simply ap¬ 
proximate xi by smooth compactly supported functions. 
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(i) Short intervals, \I m \ < 2 _jf . Then the modulation localization operator Q+j can cause 
significant overlapping of outputs coming from inputs in different intervals I m . Hence our 
strategy is to harmlessly discard Qf-j as follows: 

WQtjC^Xim&Wliy ^ 2a || < Y 

m mm 

(ii) Long intervals, \I m \ > 2 _J . Then for each m we use a partition of unity adapted to I m 
to write 

1 =Xi m +Yx l im 

i> o 

where Xi m is a smooth cutoff selecting a 2~i neighborhood of J m , while x\ m select the region 
at distance 2 _J+/ from I m . Correspondingly, we write 

Qtj Y Xl ^ = Yl XlmQtj(Xlm<P) + Y Yl X l I rn Qtj(Xlm<P) 

m m /> 0 m 

Now we estimate each of the sums above. For the first one we use the fact that the bump 
functions xi m have finite overlapping to write 


Y, XlmQtjiXlmfyWs ™ 9 ~ Y< WXlmQtjiXlm^Ws^ ~ ^ 11 Q<j ( Xim 4>) 11 S™? 


which suffices thanks to part (1) On the other hand, in the second sum, for each l we 
have at most 2 l overlapping bump functions. So we obtain 

II Y III Tf ~ 2 l Y, \\x\ m Q<j(Xl m 4>)\\l^> ~ 2 i 2 J Y, Wx'lmQtjiXlm^Wl 2 


< 


2 1 Y 2 ~ 2NI \\X 


Im<P\\L™L 2 


which again suffices. Here, at the last stage, we have used the fact that the operator 
Xj m Q<jXJm h &s a 2~i2~ Nl norm from L°°L 2 to L 2 L 2 , which is due to the separation of 
supports of the two cutoff functions. □ 


Last but not least, we consider the effect of extension on some of our Strichartz or energy 
dispersed norms; the role of these norms in our work will be explained in Section 13.31 For an 
interval I we denote by Xi a generalized cutoff function, which is adapted to the 2 k frequency 
scale: 

Xi(t) — (1 + 2 fc dist(t, I))~ N ■ 


For a function <fi in / we denote by f> e j xt its extension as homogeneous waves. Then we have: 


Proposition 3.4. Assume that |/| >2 k . Then the following estimates hold for 0/ localized 
at frequency 2 k : 

h‘(4 , r , ,2-‘<wr‘)ii „„ (3.i6) 

Xl{ t )\\‘PT t { t )\\ED ^||0/||eD[7]. (3.IT) 

where (p, q ) is any pair of admissible Strichartz exponents on M 1+4 . 
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Proof. By rescaling, we can take k = 0. It suffices to consider the case when / has the 
minimal length, i.e., |/| = 1 ; the general case is then easily recovered by applying the same 
proof to unit-length intervals at each end of I. By translation invariance, we may take 
1 = [ 0 , !]• 

We first consider the bound (13. 16|) . It suffices to show that for any Strichartz norm L p L q 
and 0 / localized at frequency 1 we have 

\\x°i(^T t ^t^T t )\\LPLv < ||0j||lpl?[j] + ||D0/||l 2 [/]■ (3.18) 

By the inhomogeneous Strichartz estimates, this reduces to the case when D0 = 0. 

We prove this in two steps. First we notice that the Cauchy data at time 0 satisfies 


l|0/[O]||z,8 < ||0/||iPL«[i] (3.19) 

Equivalently, we have to show that for functions f± localized at frequency 1 we have 

ll/±l|i. <l|e‘ ,|D| / + +e-‘ ,|D| /_|| t , ([0 , 1]il , ) 

We may easily fold t 1 G [0,1/3] and t 2 G [2/3,1] such that the L q norm of e^WI j + _|_ e -*hl- D ly_ 
is bounded by the right hand side. Then the desired conclusion follows from the linear 
independence of the symbols e~ ltj ^) for j = 1 , 2 . 

Secondly, we have the bound 

ll(0,M(f)|| w <(l + t 8 )H||(0,a t 0)(O)|| Lg , 2 < q < oo (3.20) 


This is trivial for q — 2. In the case q = oo, for a fixed (t,x) G M 1+4 , by finite speed of 
propagation we may truncate the initial data for 0[O] outside a ball of radius C(l + £) without 
changing (0, <9 t 0)(t, a;). Then the desired bound for |(0, d t (p)(t, x)| follows from Holder’s 
inequality (to control the energy with the L°° norm), the energy estimate and Bernstein’s 
inequality. 

Putting together (j3. 19(1 and (I3.20P we obtain (13.18[) . Finally, the bound (I3.17P follows from 
(13. 20p with q = oo. 


□ 


3.3. Smallness: energy dispersion and divisibility. Since our goal is to work with large 
data MKG-CG solutions, it is crucial to have at our disposal sufficient tools to gain smallness 
in appropriate settings. One such source of smallness in this article is the energy dispersion , 
which is used as an a-priori bound. Another venue for gaining smallness is to partition the 
time in finitely many subintervals, on each of which the norm is small. A space-time norm 
for which this procedure works is said to be divisible. In this short subsection, we provide 
heuristic explanation of both concepts and their use in our context. 

We start by discussing the use of energy dispersion as a source of smallness. For applica¬ 
tion, it is useful to quantify the smallness of the energy dispersion norm || • \\ed in comparison 
with the norm || • H 51 , which is stronger and have the same scaling. We therefore define: 

Definition 3.5. For any interval /CM and e > 0, we say that 0 G S 1 [/] is e-energy 
dispersed (with respect to the S 1 norm) if 

||0||bd(j) < elMIsq/p 
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(3.21) 














Observe that the S 1 norm is stronger than the ED norm by Bernstein’s inequality, i.e., 


1101 \ED(I) ^ ||V0|Uoo i2[J] < 11 0 11 51 [/]• 

Hence the dimensionless quantity e > 0 can be thought of as measuring the improvement 
relative to Bernstein’s inequality. 

Roughly speaking, small energy dispersion improves balanced frequency bilinear inter¬ 
actions. In [23], this improvement was obtained by interpolating the ED norm with the 
Wolff-Tao bilinear estimate in L P L P with p < 2. I 11 the present setting, as we have stronger 
dispersion due to higher dimensionality, we can achieve the same end by simply interpolating 
the ED norm with Strichartz norms. Indeed, the following linear lemma covers essentially 
all of our usage of small energy dispersion: 

Lemma 3.6. Let 0 G -S' 1 [/] be e-energy dispersed. Then for any k G Z and any non-sharp 
pair of Strichartz exponents (p, q ) G [2, 00 ] (i.e., “ + | < § & n d p f 2), we have 

sup ||(P fe 0,2~ fe P fc <%0)|| LPL q 7 ] < e 5 l 2 ( 2 _ p _ 5 )A! ||0|| s .i [/] , 

k 

where = 5±(p, q) > 0 . 

As discussed, this lemma easily follows by interpolating the ED norm with the Sf r com¬ 
ponent of the S 1 norm, which is possible thanks to the non-sharpness of (q, r ). We will often 
combine this lemma with Holder’s inequality to gain smallness for multilinear estimates. 

We now turn to the use of divisibility in our work. The bound (13.1 3ft shows that the N 
norm is divisible. However, the S 1 norm is notQ divisible, and this is a source of trouble. 
Our workaround is to introduce a weaker norm, denoted DS 1 , which collects a subset of the 
components of the S 1 norm which are divisible. This is defined as follows: 

11 011 DS 1 [/] = ||(|P|®0> \D\~sd t (j))\\ L 2 L6[I] + ||(| D |®0, |P| _ 6^(/>)|| il0L 30 [j] + \\\D\~2Du\\ L 2 L 2 {I] . 

(3.22) 

Precisely, we may include here any divisible Strichartz norm as long as we stay away from 
the L°°L 2 endpoint (i.e., the energy). To gain divisibility for Ho, we use the norm 

— ||(Ao,^Ao)|| L2 ^3 xi 2-[ /] , 

which is a divisible component of the Y 1 norm. 

By Proposition 13.41 we see that the homogeneous wave extension (iff 1 of a function 0/ in 
/ obeys the bound 

\\x k i<t> e i xt \\DS' ^ 110/1 Ids 1 !/] (3.23) 

when 0 / is localized at frequency 2 k and |/| > 2~ k . 

Our strategy will be to use as much as possible the divisible norms (such as DS 1 or L 2 H 2 ) 
in our bilinear and multilinear estimates, and try to prove smallness for the remainder. 


7 See however the result in Theorem I5.l|l5)| 
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4. The decomposition of the nonlinearity 


Recalling the definition of the currents J a = — Im(0.D a 0) we write the MKG-CG system 
again here as: 

□A, = ViJ x , (4.1a) 

□ a0 = 0 (4-lb) 

The second equation also includes A 0 and d t A 0 , which are obtained from the elliptic equations 

AA 0 = J 0 , A d t A 0 = W, (4-2) 

We now discuss the bounds for each of the components of MKG equation. For the purpose 
of this section, all analysis is done in a fixed time interval / = [0, T]. 

We remark that for the most part, bilinear and trilinear estimates for the nonlinearities 
were already proved in [13] in the context of the small data problem. Our goal here is 
to understand when and how we can regain smallness in the study of the large data. As 
discussed in Section 13.31 there are two such sources of smallness: 

a) Arising from norm divisibility for either A or 0, where a large but divisible norm is 
made small by selecting a suitable time interval partition. Here we seek to use the DS 1 part 
of the S 1 norm to measure the bulk of the nonlinearities. 

b) Arising from small energy dispersion for 0. This is often considered coupled with the 
additional high modulation bound 

<£ Sl Ws> (4.3) 

which for MKG-CG solutions is an easy consequence of the e-energy dispersion; see Theo¬ 
rem 15.21 below. 

Two easy ways to gain the two types of estimates above is by using suitable Strichartz 
estimates. Precisely, for divisibility we need L p L q norms with p < oo. On the other hand 
for energy dispersion we need p > 2, as well as non-sharp pairs of exponents ( p,q ), so that 
Lemma 13.61 is applicable. Often we can fulfill both at once, and prove the two types of 
estimates simultaneously. 

4.1. The terms A x . We decompose A, into a free and a nonlinear component, 

Ai = A{ ree + A;f 

where 

OA{ ree = 0, A{ ree [ 0] = Ai[0] 

and 

UA? = ViJ x , Af[ 0 ] = 0 

Given the expression of the currents J a = — Im(0<9 a 0) + A Q |0| 2 , we will think of A™ 1 given 
by the above equation as a multilinear expression in 0 and A, i.e., A™ 1 = A00,0, A). We 
can also extend this to a symmetric quadratic form in the first two variables, Aj(0 1( 0 2 , A). 
We also split it into a quadratic and a cubic part, 

A001, 02, A) =A'(01, 02 ) + A^(01, 02, A) 

— — 1 ^ > i (01<9 x 02 + <9x0102) + ~GI 1 T > i (0102 A x + 0!02 A x ) , 


20 








where □ 1 / denotes the solution to the inhomogeneous wave equation Du = f with u[0] = 0. 
The N bounds we need for DA X are as follows: 

Proposition 4.1. Let 0 i, 02 , A be test functions defined on a time interval I containing 0. 

(1) For all admissible frequency envelopes c, d, e we have 

I|DA^(01,0 2 )||iv cd [i-] + ||VA^(0i, 0 2 )||s cd [/] < 110i11s*[/]110211[/] (4-4) 

respectively 

IPA x (0i, 02, ^)^L l L 2 cde [i\ + ||VA^(0i, 02, A)\\ Scde [i] < 110i 11 [j] 110211 zjs'I [/] 11A11 [j-] (4.5) 

(2) Further, for each m > 0 there is a decomposition 

A x (01, 02) = A^ sma ;00i, 02) + A^ arfle (0i, 02) 


so that we have 

II small (01,0 2 )||lV cd [J] + || ^ ^x, small (01, 02)||s cd [/] $ 2 Cm 1101 11[/] 1102 11 S\ [/] (4-6) 

respectively 

IPA^ arse (0i,0 2 )||iv cd [/] + l|VA^ ar9e (0i,0 2 )||5 cd [7] $ 2Cm ||0i||ns:i[/|||0 2 ||DSi[i] (4-7) 

(3) In addition, if <f\ is e-energy dispersed and satisfies (14.31) then 

IPA X (01,0 2 )|U C [J] + l|VA^(0i,0 2 )||5 c [i] < 110111S' 1 [/] 11<0211S'![i-] (4.8) 


and 

IPA x (0i, 02, A)||jv de [7] + IIVA^( 0 i, 02 , A)|| Sde [j] < P 1 || 0 i|| 5 i[/]|| 02 || s i[/]||A|| 5 i[ / ] (4.9) 

Remark 4.2. The ||V(-)||s norm bounds follow immediately from the control of ||□(•)||jv 
thanks to (13.lip and the fact that the initial data vanish for A 2 , A$ As we see from (j3.3[) . this 
norm is slightly weaker than the main ‘solution norm’ S 1 for high modulations; nevertheless 
these bounds will prove useful in the proof of the multilinear estimates in Section 0 

Remark 4.3. Given a test function 0! on / which is e-energy dispersed and obeys (14.31) . 
the bounds (14.8[) and (14.91) still hold with the same right hand sides if we replace 0i by its 
frequency projection ( e -g-, P<k * 0! or -P>fc*0i) on the left hand side. This fact will be evident 
from the proof. The same remark applies to all the other estimates in this section that rely 
on e-energy dispersion. 

This proposition is proved in Section [71 

We also state high modulation bounds for DA X , which do not require a null structure nor 
an extra decomposition: 


Proposition 4.4. Let 0 1; 0 2 , A be test functions defined on a time interval I containing 0. 
For all admissible frequency envelopes c, d, e we have 

IPA x (0i, 0 2 )||( L 2 i j-^ nL § i j-|) cd [ J ] 

IPA X (01, 02, A)|| (i 2 H-h nL ZH-V) de [I] 

In addition, if i is e-energy dispersed, then 

IPA X (01, < ^ 2 )ll( L 2 i j-3 nL § J j-|) d [/] 

IPA.(01, PP)H(L 2 if-pL§if-3) de [/] 


~ 1101 1 DSl [/] 110211 DSj [/] 

(4.10) 

($1101 1 DSl [/] H02 ILo5j[/] P| DSi[7] 

(4.11) 

$P 1 ||0l||s 1 [7]ll0 2 ||si[/] 

(4.12) 

$ e5l ||0l||s 1 [/]ll0 2 ||si[/]||A|| S i[/] 

(4.13) 
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Proof. The whole proposition is a simple consequence of Bernstein’s inequality, non-sharp 
Strichartz estimates and Lemma [3761 For instance, the L 2 H~ 2 norm of the Littlewood-Paley 
piece P k OA 2 ((j) kl , f> k2 ) (where <j> ki is a shorthand for P ki (f>i) is bounded as follows: 

\\PkOAlf(j) kl , (pk 2 )\\ ■ 


<|| j Pfc(0fe 1 ^0 fe2 )|| ■ 1 + II P k {d x <t> kl <i> k , 


<2~ 5 max {|& — ki\} n iii^i 


2ju L 2 H-?[I] 


*= 1,2 


16 0Z-. || . 64 

'L a L'K[I) 


where the off-diagonal gain arises from applying Bernstein’s inequality to the lowest fre¬ 
quency. As (4, fj-) is a non-sharp Strichartz estimate, this bound suffices for both (14. 10)1 and 
(I4.12p (via Lemma [3761) . Similarly, for P k \DAl((j) kl , (p k2 , A ks ) (where A k3 = P k3 A), we have 


l-ffcl—IA x {(j> kl , 0fc 2 , Afc 3 ) 11^2^—^ Jjj 


<2- im “ ( l‘- fc l}( n ll|0|hbllL«i.[,])ll|B| J A,llL»i.[/] 


*= 1,2 


9 • 4 

The argument for the norm is analogous. 


□ 


4.2. The term Aq. Here we consider bounds for both Aq and its time derivative, which are 
given by (14.2(1 . The first equation can be written in a more explicit form as 

6| 2 )A 0 = Im (0<%0) (4.14) 


(—A 


which was analyzed earlier in Lemma 12.21 As an immediate corollary of Lemma 12.21 we 
obtain the following estimate for A 0 : 

12 


W A oWl^H\[I] + Po| 




UWkm ■ 


(4.16) 


Given this bound, we return to the equations ((4.2(1 and view them simply as Laplace equa¬ 
tions, whose solutions are quadratic expressions in 0, 


which are given by 


Aq — A o (0,0, A) d t A 0 — <9(A O (0,0, A) 

Ao(0, 0, A 0 ) =Aq(0, 0) + Aq(0, 0, A 0 ) 

= - A _1 Im(0<9 t 0) + A _1 (00A o ), 

0 O A O (0,0, A x ) =d 0 A q(0, 0) + 0 o Aq(0, 0, A x ) 

= — A _1 0 J Im(00,-0) + A ~ 1 <9 J (00A ( ). 

We also extend these to symmetric quadratic forms in the first two variables 0i,02- Our 
estimates for A 0 and <9oA 0 are as follows: 

Proposition 4.5. Let 0i, 02, A be test functions defined on a time interval I. Let c,d,e be 
admissible frequency envelopes. 

(1) For any exponent 2 < p < 00 , we have 


I|Ao(0i,0 2 )|| 
|Aq(01, 02, A 0 )|| 


cd 


LPH cd p [I] 


■ i+c. 


< II ( / ) i||dS1[/]II ( / ) 2||£)51[/] 

~ ||0i||dS1[/]||02||dS^[/]I|Ao| 


(4.16) 


LphI +p [I] 


22 









(4.17) 


l|d(Ao(»)'l,»/'2)|| 1 IS II‘S'i||dS1[/]II02||dS c 1 [/] 

1:1 cd L J 

||ftAo(0i,02,^)|| .i ~ 110111DS* [7] 110211[/] 11 11D5J [/] • 

LPH cde il J 

(A) In addition, if fi i is e-energy dispersed then 

||A O (01, 02, A 0 )|| 1+1 < £ <5l ||0 1 ||sl[/]||02||s c 1 [7](l + Po|l L ^l + i 

c N l ‘ j (4.18) 

||^A o (0i,02, Ax)\\ Lp ^r^ < £ 1 ||0l||s 1 [/]||02||si[7](l + llAJsit/]) . 

We omit the proof, as it is similar to Proposition 14.41 

4.3. The 0 equation. We will split the 0 equation into a leading order paradifferential 
approximation plus a perturbative part. The paradifferential approximation is given by 

□7* = n + 2 iJ2P<k-mA a d a P k . (4.19) 

k 

Here we retain the freedom to choose m arbitrarily large later on. Then the operator IZU is 
written as 

□ A = + Ma (4.20) 

where M% = Mff 2 + is given by 

M = 27 J2 P>k-mA a d a P k 0 - ld t A 0 fi , 

k (4.21) 

=A a B a ^ . 

The operator A 4™ will play a perturbative role in our analysis, just based on the S 1 and 
L 2 H 2 bounds for the coefficients A x , VA 0 . Precisely, for its quadratic and cubic parts we 
have: 

Proposition 4.6. Let A,B,fi> be test functions defined on a time interval I. Let c,d,e be 
admissible frequency envelopes. 

(1) The cubic part M.a’b satisfies the bound 

IIAO’lliwH £ 2^||(A. V4,)|| (MlxI , # . )c[I] ||(B„ VB„)|| (M1><124 (4.22) 
where ||(4, VAOII^,^^ » a shorthand for (||^T a; ||z?s r i[/] + || VA 0 ||^ 2 ^i ^ ). 

(2) The quadratic part Ad™’ 2 admits a decomposition 


ju m ’ 2 - A// m ’ 2 4- A/f m ’ 2 

JVI A — JVI A, small + Jvl A,large 

(4.23) 

so that we have 


WM^smalMN^I] < 2" Cm ||(A x ,-VA O )|| (slxL2i/ i )c[J] ||0|| 5 i [ 7 ] 

(4.24) 

while 


< 2 C "1I(4, V4)|| (Dslxl ^ J)t|J| IW|| s , m . 

(4.25) 
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(3) Further, if if is e-energy dispersed and obeys 04.311 . then the quadratic and cubic parts of 
M™ satisfy 


\\M m /n*<v\ ~ (2°™^ ii Aiu.,,] + iivaii iss j i;] 


s^i] 


(4.26) 


'{S 1 xL 2 H^) c [I] II ^^°)H (S 1 xL 2 Hh) d [I] 


S fi7] • ( 4 - 27 ) 

m,2 


as well as 

\\M^\\ Ncdm <2 Cm E s ‘UA I ,^A 0 )\\ 

This result is proved in Section [0 We remark the different roles of A d 'a small versus 
Mff 'iarge- The first one is small, and thus directly perturbative. The second is not small, 
but is instead estimated using only a divisible norm of A\ thus we can partition time into 
finitely many intervals where it is small. 

Our next goal is to compare the operators and D^)” ee , where we use the convention 
A pee = 0. We define the bilinear operator Diff™-0 by 

Diff^ = nr - ° = 2* E p <k-mA Q d a P k fj. 

k 

Hence we have the decomposition 

= D^ee + Diff> 

For the last term, we no longer use only the S' 1 and L 2 H\ bounds for A x and VA 0 , but 
instead we rely on the fact that A x and A 0 come from the equations (14.1 all . (14.2p . Thus, we 
replace Diff™,^ with the multilinear operator 

Diff£(0,0, A) = P<k-m A Q (0, 0, A)d a P k . 

k 

As before, we extend this operator to a symmetric quadratic form in the first two inputs. 
For the multilinear operator Diff^(0, <f>, A)ip, we have the following estimates: 

Proposition 4.7. Let 4 >i,4>2 , be test functions on a time interval I containing 0. Let 
c, d, e be admissible frequency envelopes. 

(1) The quadratic and cubic parts of the operator DifF£(</>, <f, A) satisfy the bounds 
||DiffX’ 2 (01,0 2 )^||iV / [7] < 110111 S'* [7] 1102115^ [J] 11V’ 11 [I] 

respectively 


||DiflF^’ (0i, 0 2 , A)ip\\ Nf [i] < ||K, VA 0 ) 
where 


'(DS 1 xL 2 H^)[I] 


110111 DS^ [I] 110211 DS\ [7] 11011 SI [7] 


f (P) ^(fc) || C<k—m || C 2 || d<k—m || £ 2 . 

(2) Further, for each m > 0, Diff^’ 2 admits a decomposition 


DiffX’ 2 = Diff A [Lall + Diffuse 

satisfies a better bound, 

\m7lmaU^m\N c[ I] < 2- Cm ||0|| 2 Sl[/] ||0|U c Md 
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pm,2 


cm,2 


so that Diff 


m, 2 

A,small 


(4.28) 

(4.29) 

(4.30) 

(4.31) 


(4.32) 






while Diff A’Lrge estimated directly in a divisible norm, 

\miir g e^mN c [I] < ^MlswUhm (4.33) 

This result is proved in Section |8l Again, we remark that the large part is estimated using 
a divisible norm, which can be made small by subdividing the time interval. We also remark 
that here we are concerned with unbalanced frequency interactions, so the energy dispersion 
plays no role. 


For the gradient terms in we only have the following dyadic bound from [13J: 

Proposition 4.8. For a divergence free homogeneous wave A we have the dyadic bound 

WPkAidjPrfWn < \\P k A[0]\\ E \\P^\\ s i, for k<l. (4.34) 

Due to the lack of b 2 dyadic summation with respect to k in the above bound, the gradient 
terms in need to be treated in a nonperturbative manner. This issue was addressed 

in the small data case in [T3J by constructing a microlocal parametrix. Here we adopt the 
same strategy, but using a different source for the smallness, namely the frequency gap m: 

Theorem 4.9. Let \A p jf n be the paradifferential gauge-covariant wave operator defined on 
line (14.19k and suppose that □A-f ree = 0 with ||H-f ree [0] ||#i xi 2 < E. If m is sufficiently 
large, m>£ 1, then we have the linear bound: 

IMI* ^ II^OllU+IP^e^H^-innfA-!- ( 4 ' 35 ) 

Section [9] is devoted to the proof of this result. 

Finally, we end this section with estimates that are relevant for high modulation bounds 
for (f. As before, no null structure is necessary. 


Proposition 4.10. Let A,B,f be test functions defined on a time interval I. For all ad 
missible frequency envelopes c, d, e, we have 

II^CVll 


'(L 2 H~?nL% H~y) cd [r\ ^^o)|| (DS 1 xL 2 H?) c [I] 


DS][I] 


\L?H-lnLZH-V) cde {I) 


< 


IIK,VAo)|| 


(DS 1 xL 2 H^) c [I] 


x || (H x , VH 0 )|| (DSl><L2 ^i )d[7] 


For every m > 0, we also have the bound 

ll Diff ^ll(n 2 A-^nn§A-f) cd p] * 

with an implicit constant independent of m. 

In addition, iff is e-energy dispersed, then 

IIACVll 


{L 2 H~?nL%H~V) C [I] IK^, VA°)||( Sr i x jEj2 ^i) c [ X] 

11 Dhf a IKAz, ^o)ll( S i xL 2^^) c [ / ] 




'(L 2 H~?nLZ h~l) cd [/] IK^’ ^^°)ll(S 1 xL 2 h'2) t .[/] 


x II ^' B °)ll(SixL 2 h^) d [/] 


MdsHi] 
DS\[I] 

sm 

sm 

ll^lUdd 


(4.36) 

(4.37) 


(4.38) 

(4.39) 

(4.40) 

(4.41) 
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Proof. The proof is similar to Proposition 14.41 and 14.51 We sketch the case of the L 2 H 2 

9 * 4 

norm and leave the case of the L$H~9 norm (which is a simple variant) to the reader. 

Compared to the frequency dyadic estimates in the proof of Proposition 14.41 it suffices to 
note that the following estimates hold: 

where we omitted [/] and used the shorthands A k = P k A 0 , B k = P k B 0 and 0*. = P&0. 
The off-diagonal gain is again due to the freedom of choosing where to apply Bernstein’s 
inequality. Moreover, by interpolation with the Sobolev trace theorem, note that 


114 


Z, 11 , . 5 <- 


II v A* 


1 L 2 H 2 


Since (00, |) is a non-sharp Strichartz exponent, the above estimates suffice for both divisi¬ 
bility and e-energy dispersed bounds (via Lemma [376]) . □ 

5. The structure of finite S 1 norm MKG waves. 

Here we consider an MKG solution (A, 0) on a time interval / = [0, T], with finite S 1 
norm for (A x ,(j)). Our main result is an accurate characterization of such maps: 

Theorem 5.1. Let ( A,q i>) be an admissible C t PL 1 solution to the MKG system (II.6p in the 
Coulomb gauge (II .5(1 on the time interval I = [0, T\ which has energy E and S 1 norm F, 
i.e., ||(Aj, 0)|151 [7] < F. Let c be a frequency envelope for the initial data (A, 0)[O] in the 
energy space H 1 x L 2 . Then the following properties hold: 

(1) (Linear well-posedness for \Aa) The linear equation 

□a 0 = /, 0[O] = 0o 

is well-posed, with bounds 

<=. lb/4nllK + Ilfll . 1 *. * ( 5 . 1 ) 


Shil] 


jF ||0[O]|k + ll/ll(jvnL 2 h _ 2nLii4”i) t i[/] 


for any admissible frequency envelope d. 

(2) (Frequency envelope bound) The solution (A, 0) satisfies 

ll(4z,0)||si[/] Sf 1- 

(3) (Refined Maxwell field bounds) We have 

II OAi || . 1 9. 4 

11 211 (NnL 2 H~ ?nL-5H~y) c 2 [i] 


(5.2) 


<F 1, l|VA 0 ||yl [j] <F I- 

rdF 1, 


'(Ann 2 if - 2 nit H~y) c [i] 


(5.3) 

(5.4) 


(4) (Refined scalar field bounds) We have 

|| □^4/ree0|| 

and for each m > 0 the following paradijferential estimates hold: 

^ 2 Cm • < 5 - 5 ) 

(5) (Weak divisibility of S 1 norm) There exists a partition I = Uf=i h- with K < F 1 so that 

11(4 0)||s 1 [J fc ] 1 (5.6) 

where the implicit constant is C(E + E 2 ) times the constant in Theorem\4-9 
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Proof. As a preliminary step, we observe that from Lemma [2.21 (14. 150 . (14.16P and (14.17P we 
obtain the bound 


IIVA 




5-.p 1. 


(5.7) 


We remark that this bound will later be refined when we prove (3) 


(1)). We first prove well-posedness for the equation 

i>[o] = (ip o ,-0i) 


= /, 


together with the bound 


M* ~ WM\E e + 


(NnL 2 H~?nLy H~y) e 


(5.8) 


provided that m 1. This is done perturbatively, based on the similar result for UP 


Theorem 14.91 Using also (15.7p . we can split time into Of( 2 WQCrn 
ll^lbsq/n] + IIAxIbsq/n] + l|VA 0 || i2 ^i^j <f 
Then within each interval I n we write the equation above in the form 


'Afree 


m 


) intervals I n 

2~2 Cm 


so that 


' — Af ree 


= ~E>iff» + / = -(Di + Diff™" 5 + + / . 


By Propositions 14.71 and 14.101 all the terms on the right are perturbative in N D LrH 2 n 
LsH~9[I n ], so if m ^$>f 1 then within each such interval we can solve the above equation 
perturbatively. Reiterating, the global solvability along with (15.8p follows. We note that 
in this argument the free part of A is reinitialized in each interval /*.. The nonlinear part 
K l = A(<M, Ar) is also defined separately for each interval. 

To get the well-posedness for the CU equation, we repeat the above argument for the 
expression A4™. For the N norm, we apply Proposition 14.61 Then the small part is treated 
perturbatively by taking m 1, while for the large part we use again a time interval 
division in order to gain smallness. For the L 2 H ~2 nLsH~9 norm, we use Proposition 14. 101 
and rely on divisibility for smallness. 


(2)). The (p bound is a direct consequence of the bound (15. ip applied to (p. Then we get the 
, (I4.4p ~ fl4.5p (for the N norm) and (14.1011 - (14.Ill) (for the L 2 hA nL§i7 _ t 


A x bound from 
norm). 


(14.15P and Proposition 14.161 


(3)). The A x bound has been proved in (2), while the desired Aq estimate follows from 


(4)). For the N norm, the bound for □ ^(p is a consequence of the estimates (14.221) . (14.241) 
and (14.25p for the components of O^ 171 (p = —A4™. For transition to (P we use in addition 

the bounds (I4.28P and (14.291) . We can switch back from □ p Jjl ee (p t° 0 A f ree (p using again the 
estimates (14.221) . (I4.24p and (I4.25P but for A = A^ ree . Finally, for the L 2 H 
bound, we use Proposition 14.101 for all parts. 


1 9 * 4 

2 n l$h 9 


(5)). By Proposition 12.11 and conservation of energy, the linear energy Eu n (A x [t], cp[t}) is 
bounded by E + E 2 uniformly in time. Moreover, the N norm is divisible by (13.13p . hence 
the A part is a direct consequence of (15.31) . The similar assertion for <p follows similarly from 
the divisibility of the N norm and the second bound (15.5p . since for a fixed m E, the 
□^™: ee equation is well-posed in S 1 with implicit constants depending only on E. □ 
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With Theorem 15.11 in hand, we may easily prove the continuation and scattering theorem 
(Theorem 11.61) . 

Proof of Theorerri Al.fA We start with the continuation result. The idea is to use the frequency 
envelope bound in Theorem 15.II to show a uniform lower bound on the energy concentration 
scale r c for all t G /, which allows us to apply Theorem 11.41 

By Theorem 15.11 we see that ( A , <f>) obeys the frequency envelope bound 

l|A)||yt,[/] + ||(Ae)0)||si[/] < F 

where \\c\\p < ||(^4^, 0)[O]||#i xi 2. In particular, Hindoo ||cfe||^2( fe> ^ = 0. Recall also that both 
Yf and S* control || V(-)||io°x,|- Hence given any small number 5 > 0, there exists £ G Z such 
that the splittings A = A iow + A high := A <e + A> £ and 0 = fi ow + 4> high := f <e + obey 

+ \\V(j>high(t)\\L 2 < 

and by Bernstein’s inequality, 

\\Afj,jow(t) ||l°° + II ||l°o + ||0Zou>(£) ||l°° + ||V0Zoto(£)||z,°° <F,C 1- 

Both bounds are uniform in t G I. Using Holder’s inequality for the low frequency part, we 
can find r = r(F, c, h) > 0 such that 

II (Ao ^iH At )(f)||( i ji nL 4) xZ/ 2( B .( a .)) + ||(0, ^t0(^)||(ijinL 4 )xL 2 (B f (a;)) < ^ 

for every t G 1 and ball B?(x) of radius r and arbitrary center x G M 4 . Recalling the 
definition (11.111) . we see that the energy concentrations scale of the data for (A, <f>) at time 
t is uniformly bounded below by r > 0, if 8 > 0 is chosen sufficiently small depending only 
on E. Hence by Theorem 11.41 (H, 0) can be continued past the endpoints of / as an CtH 1 
admissible solution with appropriate S 1 and Y 1 bounds. 

The scattering statement is an easy consequence of (15.31) . (15.41) . and divisibility of the 
N fl L 2 H~^ D L§i7 _ l norm. □ 

5.1. MKG waves with small energy dispersion. Here we continue the analysis above, 
but add to it the small energy dispersion condition. 


Theorem 5.2. Let (H, 0) be an admissible C t TL l solution to the MKG system (11.61) in the 
Coulomb gauge (11.5j) on the time interval I = [0, T], which has energy E and S 1 norm F. 
Suppose furthermore that 0 is e-energy dispersed. Then the following properties hold: 


(1) (Elliptic bounds) We have 


1 VH 0 | yi[/] < F £ 51 ||0|||i 

(5.9) 

(2) (High modulation bound) 



(5.10) 

(3) (Maxwell field bounds) We have 


HAz llsRJ] + 11 ^ Ar 11 ^ NnL2 ^- 2nzJi/-i)[/] £ 1 His 1 

(5.11) 

(4) (Scalar field bounds) Form > 0 we have 


warn , ■ i 9- 4 r < F 2 cm £ 5i |uii 5 i 

11 A YU (NnL 2 H~?rL'5H~v)[i] imis 

(5.12) 
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Proof. (1)). The bound (15. 9ft follows directly from the estimate (14.18I) . 


m- 

HJJ. 

El). 


These bounds follow from (14.39^ - (14.411) . 

The estimate (15. lip follows from (I4.8|) - (I4.9D . (I4.12[) - (I4.131) for HA™ 1 and (13. 121) . 
The bound (j5.12[) is a consequence of (14. 26H - (14.2711 . (14.391) and (I4.4ip . 


□ 


6 . Induction on energy 

Here we provide the induction on energy argument which gives the proof of our main result 
in Theorem 11.51 Our induction hypothesis is that the conclusion of the theorem holds up to 
energy E. Thus we have F(E) and e(E). Our goal is to show that there exists c 0 = c 0 (E ) > 0 
so that the conclusion holds up to energy E + c 0 . Moreover, we do not allow c 0 (E ) to depend 
on F(E), but only on E. The independence of Cq(E) on F(E) allows us to additionally 
ensure that co(-) is a positive non-increasing function on the whole [0, oo); this property is 
what makes our induction argument work for all energiefl 

To begin with, we observe that it suffices to establish Theorem 11.51 for smooth solutions. 
Indeed, Theorem 11.41 implies that any admissible CtH 1 solution ( A,q !>) can be approximated 
by smooth solutions in the S 1 [J] norm (and hence also in the ED[J} norm) for any compact 
interval J. Thus, we consider smooth data (AJO], 0[O]) with energy E + c 0 , generating a 
smooth solution (A, 0) in [0, T ) with ||0||_ed(o,t) — e ^e 1- Then the S 1 norm ||(Ha,, 0)||si(o,t) 
is a continuous function of time t G (0, T), satisfying 

jimll^^lU 1 ^) < E 1/2 . 

Hence, in order to prove a uniform bound 

II (4s, 0)IU 1 (o,q < F (6.1) 

we can make the bootstrap assumption 

11(^,0)1151(0,0 < 2F, (6.2) 

where F is a positive to be determined in the proof. By scaling we harmlessly take t = T. 

Indeed, once we show that (16.Tj) holds assuming (16.21) . a simple continuous induction 
argument in time implies that S' 1 (0,T) norm of (Aj.,0) is bounded by F. This bound is 
precisely (11.131) with F(E + c 0 ) = F. Note that the parameter e becomes e(E + c 0 ) in 
Theorem 11.51 

Next, we dispense the easy case when the S 1 norm of 0 is disproportionally small compared 
to the overall energy E of (A, 0). This procedure allows us to link the small energy dispersion 
assumption ||0 ||ed(o,t) < e to the notion of e-energy dispersion (Deffiiition 13.5ft for some 
£ = e(e). More precisely, given e > 0 to be determined, we consider two cases: (i) ||0||si(o,T) < 
eE 1 / 2 or (ii) ||0||si(o,T) > sE 1 / 2 . In case (i), a direct application of (j4.4D - (14.5j) . (14. 10[) - (14. 11 j) 
and the bootstrap assumption (j6.2[) gives 

\\nA x \\ , . i 9 . 4 < £ 2 E(1 + F). 

11 x "NnL 2 H~?nL-5H~V(0,T) ~ v ’ 

Applying the linear estimate (13. 8 p and taking e sufficiently small compared to to F, (16.ip 
follows directly. Thus we are left with case (ii), in which we may assume that 0 is e-energy 

8 We refer to the beginning of Step 2.3 in the proof of Proposition 16.II for the precise dependence of Co on 
E. The conclusion is that Co needs to be chosen small enough compared to the constant in Theorem 14.91 
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dispersed (according to Definition 13.5j) by taking e = e 2 E 1 f 2 . Henceforth we eliminate e 
(which has the dimension of ( energy ) 1 / 2 ) in favor of the dimensionless parameter e. 

To establish the S 1 bound (16 .1 ]) under the assumption that 0 is e-energy dispersed on 
[0, T], we will compare the solution (A, 0) with the MKG wave (A, 0) generated by frequency 
truncated data 

where the cut frequency k* G M (hence P<k* is a continuous version of Littlewood-Paley 
projection) is selected so that (A, 0) has energy E. Note here that we only truncate 0[O] 
and A } [0]. The functions A 0 and d t A (h which are also part of the energy, are defined directly 
from the compatibility conditions (II.7J) . The fact that such a k* exists is a consequence 
of the continuity with respect to k of the A 0 component generated by P<fc(A x [0], 0[O]), see 
Proposition 12.11 We further remark that by part (3) of Proposition 12.11 the energy of both 


(A, <f>) and (A, 0) is close to the corresponding linear energy of (A x [0], 0[O]), respectively 

(A x [ o], 0 [o]). 

We wish to apply the induction hypothesis to obtain an S 1 bound for (A, 0) on [0,T], 
namely 

H(4,0)llsW)<fW. < 6 - 3 ) 

For this purpose, we need to know that energy dispersion of 0 is sufficiently small on [0, T], 
We achieve this smallness by transferring the information for ||0||e_d(o,t) to ||0||£id(o,t) by 
another continuous induction in time. 

Indeed, at time t = 0, the solution (A, 0) has smooth data and ||0 [O]||£'_d < e 2 E 1 ^ 2 <C e(P). 
Thus for some short time it will still have energy dispersion < e(P). We claim that (A, 0) 
extends smoothly up to time T, so that the stronger bound 


\\ 4 >\\ ED { 0 , t 0 ) < 2 e ^ 


(6.4) 


holds for all t 0 E (0, T\. We will establish (16. 4p under the additional bootstrap assumption 

\\4>\\ED(0,to) < c(E) (6.5) 

As before, note that we may take to = T by scaling. 

To see how the claim follows from this bootstrap procedure, let T* be the maximal time 
T* < T up to which (|6.4[) holds. Then by our induction hypothesis and Theorem 11.61 
the solution (A, 0) extends smoothly past time T*. Hence (16.5j) holds past the time T* 
by continuity, therefore (I6.4j) also holds past time T* by our claim. This contradicts the 
maximality of T* unless T* = T. 

To summarize, we have to prove that we can find Co = Co(E), F ^>e 1 and £ Cp 1 so that 
the following statement holds: 

Proposition 6.1. Assume that the MKG waves (A, 0), respectively (A,0), with initial data 
(Ae[0], 0[O]) ; respectively (A X [O],0[O]) = P<fc*(A x [0], 0[O]) ; and energies E + Co, respectively 
E, are smooth in [0, T\ and obey the following hypotheses: 

(i) The S 1 norm o/(A x ,0) satisfies (I6.2|h 

(ii) The solution 0 is e-energy dispersed (as in Definition ^. 5\) . 

(Hi) The ED norms of 0 and 0 obey 

IHU_d(o,t) < e (E), ||0||ed(o,t) < e 2 E 1 ^. 
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Then the following statements hold: 

(1) The S 1 norm of(A x ,4>) satisfies (16.11) . 

(2) The ED norm of <p satisfies (16.41) . 

Proof. Unless otherwise stated, all norms below are taken over the time interval (0, T). We 
will prove the proposition in two steps. 

Step 1: The low frequency bound. Here we estimate the difference (R,0) given by 

B = A — A <k * , 0 = 0 - 0< fc * 

and prove that it satisfies the bound 

ll(B*.«lk. + l|VBoll j <f e‘% = (6.7) 

c* 

One consequence of the -0 bound above, combined with (16.61) . is that (16.41) holds. 

Before we begin, note that Theorem 15.21 implies the following a-priori bounds: 

<fE 5, IWs., (6 .8) 

IIVAoll^. <Fe s 'IW!., (6.9) 

In particular, the bound for 0 ensures that (14.31) holds, allowing us to apply Propositions 14.11 
and 14.61 whereas the bound for VA 0 provides smallness in applications of (14.8p . 

Step 1.1: Bound for B x . To prove the estimates above, we begin with the bounds for B x . 
By definition B x has zero Cauchy data at time 0, therefore we have 

B x = A x (0, 0, A) - P <k *A x ((j), 0, A) 

Ax(0, 0) A) Ax(0<fc*, 0<fc*; T A a; (0</ C *, 0</j*, A<k») P<fc*A ;c (0,0, A) 


In the first difference above we substitute A = A <k * + B and 0 = f) <k * + 0, then use (13.81) , 
(I4.4D - (I4.5D . (I4.8p - (l4.9p . (14. 10p ~ (j4.lip and (14.12p ~ (14.13p (see also Remark 1473]) to obtain 


l|A,(0,0,A) — Aa,(0 <fc »,0<fc*, A <fc *)|| s i t < F e Sl ||(-B x ,0)||si + ||(R c ,0)||!i + || ( B x , 

c c c * c - 

The second difference is localized at frequency < 2 k * +5 , and all the 0 factors are e-energy 
dispersed. Further, we may rewrite this difference as 

P<k *+5 ^Ax(0<fc*, 0<fc*) A <fc *) A 3 ,(0^fc*_5, 0<fc*— 5 , A< k *_ 5 )^ 

B<k* ^Ax(0, 0j A) A x (0</ c *_ 5, 0<fc*_5, A<fc*_ 5 )^ 

which shows that of the two or three inputs, at least one has frequency > 2 fc *~ 10 . This input 
can be measured with the frequency envelope c* at frequencies below 2 fc *~ 10 . Thus, applying 

(USD, dMD-il) and 6A2D-(|il3D yields 

||A a;(0<fc*, 0<fc* , A<fc» ) P<k* Aa;(0, 0, A) Ugl* & 

Summing up the last two bounds, we get 






































Step 1.2: Bound for Bq. The analysis for Bq is very similar. Precisely, Bq solves the 
equation 

B 0 = A o (0, 0, A) - P <k * A o (0, 0 , A) 

Ao(0,0, A) A o (0<fe*, 0<fc*, A<&*) T A o (0<fc*, 0<fc*; A <fc *) P<fc*Ao(0,0, A) 

and the terms on the right can be estimated using (I4.16|) - (I4.18I) . The same applies for 0*i?o- 
We obtain 


IV -Bn 


L 2 h' 


<F 


(e St + 11(^^)1110(1 + II (-Si. Oils' + IIVBo 


L 2 H^ 


( 6 . 11 ) 


Step 1.3: Bound for 0. We now consider 0, which solves 

□o0 = - ( n 4 - □A <fc *)0<fc* - (□A <fc «0<fc* - P<fc*CU0) 

2 rr-i 


( 6 . 12 ) 


We start by estimating the right hand side in (L 2 H ~2 n LvH~$) c *. For the first difference, 
we write A = A<£* + I? and observe that at least one input is B (which can be measured 
using c*) and 0<fc» is e-energy dispersed. Hence by (14.39h - fl4.4in . we have 

II(Da — n A <k *)4><k * 




<F S*( 


ISills;, + ll v Soll IJS j)(l + II Sells'. + ||VB„|| 4 ). 


(6.13) 


For the second difference in (16. 12jl . we claim that the following bound holds: 


II (E-U <fe »0<fc* — P< fc *nA0)|| 


(L 2 h”3nLnh _ 5) c 


<F ^(ll-^lls; 1 , + ||VB 0 ||^i )(1 + ll^llsy + ||VB 0 ||^i. 


(6.14) 


To prove this bound, we divide further into the following cases: 

(i) At least one of the A frequencies is > 2 k *~ 10 . Note that the output frequency is localized 
to < 2 fc * +5 . Hence by measuring the high frequency input with c*, using the ^-energy 
dispersion of 0 and applying (I4.39p - fl4.4ip . we can bound this contribution in (L 2 H~ 2 fl 

l!h~I) c * by < F e Sl . 

(ii) The term □A <fc ,_ lo 0<fc* - P< fe »([IU <fc *_ lo 0)- The contribution of 0< fc «_ 5 and 0> fc * +5 is 
zero, so we may assume that 0 and the output are frequency localized near 2 k *. Then 
by e-energy dispersion of 0 and (I4.39I) - (I4.41I) . the desired estimate follows. 

To estimate the N c * norm of the right hand side in (16.121) . we use a frequency gap parameter 
m to be chosen later. The first difference in (I6.12p is expressed in the form 

(□a - □A < „)0< fe * = [M m A -M m A<k J0< fc . + (Diff™(0,0, A) - Diff™(0,0, A))0 <fc * 

In the first term, note that one of the inputs must be B. Then we use the frequency envelope 
c* for B , the ^-energy dispersion of 0 <fc », (16.8p and (16.9p via (I4.26p - (l4.27p to obtain 

(£ Sl 2 Cm \\B x \\ sK + ||VBo 


II (M 


A4 4 .,,)0<fc*||7V c * 


<k* 


>o" I ^)( 1 + l|B I || S ' + l | V B oll L2ili ) 


(6.15) 

In the second term, we first replace the argument (0,0, A) by (0<fc*, 0<fc*, A<fc*), and estimate 
the corresponding difference via (I4.28I) - (I4.29I) as 

||(DifF£(0, 0, A) — Diff)£(0 <fe *, 0<fc*, A <fc »))0 <fc *||jv c , <f 2 - <5 ° m 
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where both the frequency envelope control of N c * and the gain 2~ s ° m come from the frequency 
gap between the difference of the magnetic coefficients A(0, 0, A) — A(0<fc», 0<fc», A<fc*) 
(which is only used at frequencies below 2 k ~ m ) and its arguments (0, 0, A) (of which at 
least one must have frequency no smaller than 2 k ; we use c* to measure this input). Then 
we are left to establish 

||(Diff™(0,0, A) - Diff^(0<fc», 0<fc«, A <k *))4> <k , ||^ 

<f 2- s »">(||(B I ,V-)|U.. + HVBoll . )(1 + (6 ' 17) 

c* 

Note that one of the inputs must be (5,0). Then (16.1711 follows again from (l4.28D - (l4.29p . 
and using the frequency envelope c* to measure (5,0). 

Finally, we still have the second difference in (I6.12p in N c *, for which we claim that 

111—0<fc* - 5< fc *D40lk* k s Sl 2 Cm + 2~ cm (6.18) 

To see this we write it as 


(□j4 <jfc *0<fc* - 5 <fc *D A 0) = (M^cjxk* - 5 <fc *A^0) + [P<k; Diff(?]0 

For both differences, note that the output frequency is localized to < 2 k * +b . Canceling the 
like terms in the first difference, we are left with three types of frequency scenarios: 

(i) The frequency of one of the A’s is at least k* — m. Then we can apply fl4.26D - fl4.27D and 
(16.9|) to obtain an e s ^2 Cm bound, where we use c* to measure the high frequency input. 

(ii) We have a quadratic term of the form rra0[fe*_5,fe*+5], which can be directly 

estimated by 2~ cm using Strichartz bounds and Bernstein’s inequality. 

(iii) We have a cubic term of the form 7l< fc *_m0[fc*-5,fc*+5], which in turn can be estimated 
directly in L 1 L 2 using non-sharp Strichartz estimates, to get an e Sl bound. 

It remains to consider the commutator term [5<a,«, Diff™]0; we claim that the contribution 
of this term can be estimated by 2~ cm . It is clear that only the frequencies close to k* in 0 
are relevant here, therefore the commutator can be expressed as 

[p<*-, -4?*._ jaw*- = 2 - k± L(VA% t ._ m , aw*-) 

for a bilinear form L with translation invariant integrable kernel. The A 0 term is easy to 
deal with using the L 2 H 2 bound for A 0 , Strichartz for 0 and Bernstein’s inequality. Thus 
we are left with the expression 2 ~ k *L(VA J <kt _ m , dj(j) k *), which has both a null structure and 
a favorable frequency balance. This we can treat using the bound (17.3|) in the beginning of 
the next section (see also (17.ID ). Hence (I6.18D follows. 

Thus, summing up all cases in (16.13D - fj6.18D . we obtain 


IIn i 0 || „. 1 9.4 ^ p 

11 Ar H(jvnL 2 H - 2 nz,siT - s) c » 

( 2 -“ + 2-‘‘ m +£ i ‘2°”'(l + ||(B,W)lls;.) + l|VB 0 || i2ft 0(1 + H(B,W)|||,. + HVBoll^, ) 

Optimizing the choice of m, this gives (with <5* -C <5o c) 


I n 4011 , „. 1 9.4 

I ''(NnL 2 H-?nLSH--5) c , 


<F 


( £ J -(1 + ||(fl*W)|| s ,.) + l|VB„|| , )(1 + ||(B I W)II|,. + l|VB„|| 2 , ) 
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Recalling that ip has zero initial data, by Theorem I5.1|fl) this implies the estimate 


\m 


<1 


F £*•(!+ 11(5,^)1151.) + ||VS 0 | 


L^Hl 


(1 + II (-^S) VO II 51 . + || V-B 0 


12 

l^hI 


(6.19) 


Now we can combine this with (16.101) and (16.111) . and close to prove (|6.7|) . provided that e is 
small enough. We carefully observe here that the smallness of e depends on F. In turn, we 
will want later that the choice of F is independent of e. 


Step 2: The high frequency bound. Here we consider the high frequency differences 

^high^high^ gj ven Py 

B hujh — A- A, ip high -a-(i 

and prove that they satisfy the S 1 bound 

¥B H is ,.^M a , L )hi + ||vb 0 || i1aJ < f{e) 1, (6.20) 

provided that cq = co(E) is chosen small enough compared to E, but independent of F(E). 

Step 2.1: Energy estimate for (Bp: l9h , ip hl9h ) and weak divisibility. Here we take 
the necessary steps to ensure the independence of Co on F(E). We first use the energy 
conservation for (A, <p) and (A, (p), together with the low frequency estimates of the previous 
step, to conclude that the energy norm for (B hl9h , ip hl9h ) stays bounded, i.e., 

||(VB“» k , < Q, + 0 F ( £ i J -) (6.21) 

Indeed, for each t e (0, T ) we have 

Eu n (A,(p) =En n (A>k* , 0>fc*) + Eii n {A < k*,(p<k*) + {A>k* 1 A <k *) iji xL 2 + (0>fc* , 4><k* ) ]jl x l 2 
^ Elin(A>k* j (p>k * ) T j 0<fc* ) • 

where we omitted writing [t] and the subscript x from A = A x . We have used the fact that 
the operator P <k *P>k* is non-negative, as it has a non-negative symbol. By Step 1, we know 
that cp <k *) is equal to (H, <p) up to an error of size Oi?(e <5 ‘) in Sp*. Therefore, we have 

E lin (B hi9 \iP hi9h ) =E Un {A> k *A>k*) + 0 F (e 5 *) 

<E lin (A, (p) — Eii n (A<k *, (p<k *) + Of(s s *) 

—Eu n (A, (p) - Ei in (A, (p) + O f (e 5 *). 


By Proposition 12.11 recall that Ei in (A,(p) and Ei in (A,(p ) are 0 F {s^*) close to the corre¬ 
sponding conserved energies S[A,(p] and £[H,0], respectively. Hence by the definition of 
(A,<p), the desired estimate (16.211) follows. 

Next, we use the weak divisibility of the S 1 norm in Theorem [5711(5) to split the time 
interval / into Of(b)( 1 ) subintervals, on each of which 


II (A x , <p)\ I S 1 [J] <E 1 . 


( 6 . 22 ) 


We remark that this bound also relies on the conservation of energy for (A,<p). Due to 
the uniform bound in (16.211) . on each such subinterval J we can reinitialize the data for 

(Bp: l9h , 'ip h ’ l9h p and we no longer have any trace of F(E) or e(E). Instead, (A x ,(p) has S 1 
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norm O e ( 1) and energy dispersion Cp£ S *. Thus, it remains to show that on each J we have 
the improved bound 

\\(B hi9h ,4 hi9h )\\ sl[J] < E 1 (6.23) 

Then (16.201) would follow by adding the above over Op(E)( 1) intervals, using (13.141) . 

Remark 6.2. Compared to the low frequency estimate in Step 1, here we have a key advantage 
that we can exploit small energy dispersion for both 0 and 'ijj ,ngh , albeit at the expense of 
using the S * 1 norm of 0 on the larger interval (0,T). More precisely, Step 1 implies 

||0 — 0<fc*||s 1 (O,T) + \\^ ht9h ~ 0>fc*||s 1 (O,T) < Cp£ 5 *- (6.24) 

for some constant Cp > 1. Moreover, 0 is e-energy dispersed by hypothesis and obeys (16.81) 
on the large interval (0, T). By Remark 14.31 and a simple extension procedure^, we may gain 
Cf£ S * + e &1 ||0||si( O ,T) from 0 or ^ high whenever any of the e-energy dispersion bounds (14.81) - 
(14.91) . (I4.12l) - (14.13[) . (14.18[) . (|4.26I) - (I4.27I) . (14.39D - (14.41 [) are applicable on the smaller interval 
J. 


Step 2.2: Bound for B^ gh . The bound for B x l9h is an easy consequence of small energy 
dispersion. Indeed, using (16. 241) . e-energy dispersion of 0, (16.81) as well as the estimates 
(I4.4D - 04.5I) . (14.8ft - fl4.9p . (I4.1()[) - (I4. Ill) and (I4.12D - (I4.13D (see also Remark [6721) . we have 

( 6 - 25 > 

where we used the fact that <5* <C 5. Then by the linear estimate (13. 8 p and (16.21)1 . it follows 
that 

l|S“ i "‘lls>M ^ CO + 0 F (ei s -) . (6.26) 

This bound is stronger than what we need for (16.231) . but it will be useful in the next step. 


Step 2.3: Bound for ^ high . For we claim that a similar bound to (16.25ft but with 

respect to the CU flow holds: 


\\nAib hi9h \ 


(6.27) 


where 0 < 5** 5*. Assuming that (I6.27P holds, we can conclude (16.23[) using the following 

simple additional bootstrap argument in time. Denoting the initial time in J by t 0 , it follows 
from (I6.2ip and continuity of the S ] [J'] norm that we have 


W hi 9 h \\sHJ>]<C E {c 0 + O F (ei 5 ')) 


(6.28) 


for a suitably large constant Cp > 1 and a sufficiently short interval J' containing t 0 . Then 
to prove (16.28ft for J' = J, it suffices to establish (16.28ft under the bootstrap assumption 


U hi9h \\s^j']<^C E (c 0 + O F (£-^)). 


(6.29) 


Choosing c 0 sufficiently small depending on E and e <^p 1, it follows from (16.21ft . (16.22ft . 
(16.26)1 and (j6.29)1 that 

Il(-4x, 0)||sl[J'] 1, 


9 Technically, one extends all non energy-dispersed inputs of the form 0, A x by homogeneous waves out¬ 


side J to (0, T) (see Proposition 13.31) and A 0 by a standard Sobolev extension so that ||VAo|| 


I V An 


L 2 n?( 0 ,T) 


< 
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where the implicit constant is twice that of ( 16.221) . Now applying the linear (ZU bonnd (15. ip 
in Theorem [5T1 1(1) and enlarging Ce to be larger than the implicit constant in (15.ip . the 
desired estimate (16.281) follows. We remark that the size of Co essentially depends on the 
implicit constant in (15. ip . which in turn depends on the constant in Theorem 14.91 

We now turn to the proof of (16.27p . We first estimate the L 2 H~ 2 n L$H~ 9 norm, which 
is easier. Note that 

a A ^ hl9h = (i=u - OO0. 

Exploiting the small energy dispersion of 0 as in Remark 16.21 and applying (14.36p - (14.38p . 
(I4.39l) - (l4.4ip . as well as (16.71) . (16. 9 1) for A 0 , A 0l we obtain 


\n A i/j hi9h 


'L 2 H~?rL% H~y[j] 


< C S * 

r^F ^ 


(6.30) 


To bound the N norm in (16.271) . we introduce a frequency gap m to be chosen later. Then 
we write the \3 A equation for ijj hu J h as follows: 

n A *P hieh = (□,, - a A )i (6.31) 

=(MJ - + (□$”« - □5”..)0 + (Diffj(0, ip, A) - DiffJ(*, <P, A))l 

where the decomposition A = A^ ree + A nl is performed with respect to some fixed initial 
time £ J- 

In the first term on the right hand side of (16.311) . we take advantage of the small energy 
dispersion of 0 as in Remark 15721 and apply (l4.22D - fl4.24p . (I4.26p ~ d4.27p . as well as (16. 7f) . (16.9p 
for Ao, A 0 (here it is crucial to use the smallness factor e Sl in (16. 9p ). to obtain 


N[J] <F 2 


Crn^S* 


In the term (IZP~™ ee — □]^™ eE )0, we make a further decomposition as follows: 


(□$£- - UjSL)* = (Mjf r 

k>k*+m 

For the first difference, we use the bounds (I4.24p ~ (l4.25[) and (I4.26p . where we exploit the 
small energy dispersion of 0 as in Remark 16.21 For the second difference, we use the null 
form estimate (I4.34p together with the high frequency decay of 0 and low frequency decay 
of A — A[t 0 ] due to the c* envelope bound (16.71) . We conclude that 




II 


m 

ree 


AT[J] 


2 Cm£.8* _|_ 2 —cm 


Finally, for the third term in (16.3R . we use (I4.28P and (14.29p . The gain comes from the 
low frequency bound (16.71) from Step 1. This guarantees that, on one hand, 0 decays at 
high frequencies > 2 k *, and on the other hand the differences (4 — A, 0 — 0) decay at low 
frequency < 2 k . As the frequency gap enforces a separation of at least m, from (I4.28p and 
(H~2§P we obtain 

||(Diflx4<M) - DiffXOM, ^))0lkw 2-*”. 

Summing up, the bounds for the three terms in D A ij; h ' l9h , we conclude that 


\U A ^ hi9h 


I ^ nCra.i5* I r\- 

'NnL 2 H~hnL^H~^[J] 


+ 2 


—Squi 


Optimizing the choice of m, the desired estimate (I6.27P follows. 


(6.32) 

□ 
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7. Bilinear null form estimates 


We begin our discussion with the bilinear null form estimates, which play a key role in our 
analysis. These occur in both equations in the MKG-CG system fl 1.6 f) . In the 0 equation we 
have the expression AWj(f, under the Coulomb gauge condition A = 0. We can rewrite 
this as 

A j drf = d k d k /Sr 1 A j d j (j) = Q kj {d k A~'A j , 0) (7.1) 

where Q k j is the standard null form 

Qkj(u,v ) = d k udjV — djudkV 

In the A equation, on the other hand, we encounter the expression 

Vj((j)d x 4>) = 4>dj(j) - d k d j A~ 1 ((j)d k 4>) = d k A~ 1 Q kj (q b, 0) (7.2) 

Thus, it suffices to produce good estimates for the null form Qij. For that we have 

Proposition 7.1. Let AT be one of the Q ^ null form. Then the following bilinear estimates 
hold: 

\\PjM{Mi)\\N < 2 J 2-^- fc M^I)||0 fe || 5l ||0 ; || 5l (7.3) 

||Q < ,_ m P J W(g <fc _ m 0 fc ,Q <z _ m 00|| iV < 2^2~ Sm 2 c ^~ k ^ k ~ 1 ^\\(j) k \\ s i ||0;||51 (7.4) 

The first estimate above is the bound (131) in |T3]. For the second bound we can harmlessly 
assume that |j — k\ + \k — l\ -C C. Then (17.41) is a consequence of the bound (143) in [ 13] . 

We remark that the first bound (17.31) easily transfers to an interval /. However, the second 
one involves modulation localizations, which are inconsistent with interval localizations. 

We now use the above null form estimates to conclude the proof of all the remaining results 
in Section [H except for Proposition 14.71 


Proof of Proposition f.l. As in Remark 14.21 it suffices to estimate the || □(-) || jv norms. We 


begin with the quadratic part Aj.. For simplicity, we concentrate on the case when the first 
two inputs are identical; the general case is a minor extension. We have 

□A; (0,0) = (9jA _1 A/’(0, 0) 

therefore (14.41) follows by dyadic summation from (17.31) . 

To prove the more refined bounds for we use a large frequency gap m to first split 

□A?(0,0)= P fe a j A- 1 W(0 fel ,0 fe2 )+ P k dj A- 1 W(0 fel ,0 fe2 ) 


max{|fc— ki\}>m 


\k—ki\<m 


The first sum is estimated using (17.31) with a 2” cm constant. 

The second sum is essentially diagonal, so it suffices to estimate it for fixed k. For this 
we consider two cases depending on the relative size of the interval /. The case of short 
intervals |/| < 2~ k+m is easy to dispense with, as we have 

||-Pfe5iA _1 A/’(0 fel , 0fe 2 )||iV[/] ^ 2 _fc ||A/’(0fe 1 ,0fe 2 )||L 1 L 2 [/] 

< 2 Cm (2 k \I\)^\\\D\^(j) kl ||i, 4 [/ ] |||-P|^0fc 2 |U 4 [/ ] - 
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Here we have a large 2 Cm constant, but the Strichartz norms on the right is divisible so this 
suffices for (14.7[) . Moreover, since (4,4) is a non-sharp pair of Strichartz exponents, it will 
be sufficient for (14. 8 p too, as we explain below. 

We are left with the most interesting case. To summarize, we have \k — kj\ < m and 
|/| > 2~ k+Cm . To continue the proof we need to use modulation localizations. In order to 
be able to do that we extend </> outside our interval / by homogeneous waves. Then we 
decompose 

P^OjA A/"(0fc 1 ,0fc 2 ) Q<k—CmPkdj A A/"(Q<fc—Cm0fci j Q <k—Cm < Pk2 ) t 6u(f, fi, b) 

where the error corresponds to at least one modulation larger than k — Cm. The first term 
is estimated using (17.41) with a 2~ cm constant. For the error we produce instead a direct 
bound, with two cases: 

(i) High modulation output: 

\\Q>k-CmPkdjA 1 A/"(0fc 1 , 4>k 2 )\\N[i] ^ 2 Cm 2 ~||X/A/"(</>fc 1 ,0fc 2 )IU 2 

<2 Cm \\ X )\D\i4. h \\ L 4x)\D\iM\L‘ 

<2 c '”|||D|i* 1 || 1 . [r] |||D|i^|| 1 . [r] 

where we used Proposition 13.41 on the last line. Here the relaxed cutoff ~Xj was inserted 
in order to account for the fact that the operator Q > k-Cm is nonlocal in time. Its kernel 
decays rapidly on the 2 Cm 2~ k time scale, and this is exactly the scale captured by X/- 
Again the Strichartz norms on the right are both divisible and non-sharp, so this bound 
suffices for both (14.7p and (|4.8[) (see below). 

(ii) One high modulation input: 

||Q<fc— CmPk-^{,Q>k—Cm4 > k\ j 0fc 2 ) II N[I] ^5 2 || (Q>k—Cm4 ) ki , ) || L X L 2 

< 2 Cm \\ 0(p kl 11£211 Xi<t>k 2 \\l 2 l°° 

This suffices for (14.7j) . To complete the proof of (I4.8j) we also need to account for the 
case when has high modulation. Then we have the following small variation of the 
previous computation: 

||Q<fc —CmPkAf ((f>ki , Q>k— Cm^fe) II N[I] 2 || (jf>k \, Q>k—Cm4 > k2 ) || L 1 !. 2 

<2 c n\x k M\ L i L JVK\\ L i L2 

where the point is that (|, oo) is a non-sharp Strichartz exponent. 

To conclude the proof of (14.8p we observe that the above estimates allow us to use the 
e-energy dispersion and (j4.3j) for all the large parts of A 2 X . Hence we obtain a bound of the 
form 

PA^.^Ik,,] < (2-“” + 2 c V')||*|| sl[I] ||^|| s , [;] 

Now (14.81) easily follows by optimizing the choice of m. 

Finally we consider the cubic terms A?, which satisfy 

□ A?(0,0,A b ) = P(#A*) 

At the dyadic level, using Bernstein’s inequality in a favorable way we obtain 

\\Pk(4 , k 1 4>k2^-k 3 )\\L 1 L 2 [I] ^2 imax l fc 111 -DI 3 11 x. 4 [Z] 111 | 3 11 11 11 Z.2Z.8 [X] 
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All norms on the right are Strichartz norms and are bounded by the S 1 norms, so (14.4ft follows. 
Further, if say 0i is £-energy dispersed, then we can bound its non-sharp Strichartz norm 
L 4 using the energy dispersion at the expense of losing the frequency envelope information, 
in order to obtain (14.911 . □ 


Proof of Proposition \ 4-6[ For the leading part 

k 


oiMJ we have 


A j dj(f) = A/"(9fcA 1 Aj, <f>) 


To decompose it into a small and a large part we first consider the frequency balance of the 
two inputs and the output, depending on the frequency gap parameter m 1. 

= 2 iJ2 P^k+mAJdjPkf) + P < k—m(P < k—r n A^djPi c (p S ) + P> k _ m {P [k _ mMm) A j djP^) 

k 


The first two terms are estimated with a favorable 2 cm constant using (17.311 . and thus placed 
in Airmail- ^ remains to consider the last term. This is essentially diagonal in k 1 so we 
can freeze the three frequencies in the allowed range. 

Now we consider the size of I. As in the proof of Proposition 14.11 there is one easy case, 
namely when |/| < 2~ k+m . Dispensing with that, from here on we assume that |/| > 2~ k+m . 
The remaining argument uses modulation localizations. To allow for that we extend both A 
and (j) outside / as free waves. Then we decompose the last term above as 


P>k—m(yP[k—m,k+m)A^ 9jP k (j)') Q <k—CmP>k—m ( Q <k—Cm-P[k—m,k+m)A 2 9jQ < k —Cm.Pkf > ') 

I \ 4 m, main ± 

+ Ai A ] large P k f> 


In the first term, we gain 2 cm by (17.411 . Hence this part can be put into The 

remaining part AAf'^rge contains only terms where all three frequencies are balanced, and at 
least one modulation is large. But this is estimated exactly as in the proof of Proposition 14.11 

(i) If the output has high modulation, then we bound it in L 2 using using divisible non-sharp 
Strichartz norms to gain either the divisible bound (I4.25p . or smallness via ^-energy 
dispersion as in (14.26(1 . 

(ii) If the second input (i.e., <f>) has high modulation, then we combine the L 2 L°° bound for 
A with the L 2 bound for □</>. 

(iii) If the first input (i.e., A) has high modulation, then we gain both divisibility and 
smallness via energy dispersion by combining an L$L 2 bound for OA and an L^L°° 
bound for <f>. 


We now consider the remaining terms in Ai™. For the terms P>k-mAod t (j)k and dtA^cf we 
estimate 

ll-Pfc ^ P>k-mA 0 d t (pk2\\L 1 L 2 {i] ^ 2~ 5max ^ k - k A\\p ki A 0 \\ L2 ^3^\\\D\- 1 dt4>k 2 \\L 2 L s [i] 

k2 
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respectively 


\\p k (p kl d t A 0 <j> k2 )\\ L i L2[I] < 2- 5max f fc - fc ^||p fcl a t A o || i2 ^i [/] ||0 fc2 || L 2 L 8[ /] . 

Finally, the term A a A a (j) is estimated in L l L 2 with off-diagonal gain using only divisible 
non-endpoint Strichartz estimates, which suffices. □ 

8. Multilinear null form estimates 

In this section we discuss directly the bounds for the operator Diff^, and prove Proposi¬ 
tion [T71 The bounds (j4.28j) and (I4.29P were already proved in [13]. The delicate matter is 
to be able to estimate the bulk of Diff]^’ 2 (0, 0)0 in terms of the divisible norm DS 1 of 0. 
We split our argument into two steps: 

(i) First we review the decompositions and the estimates in [13] on the full real line, leading 
to the proof of (I4.28p and (I4.29p . But we do this in a careful fashion so that we can 
isolate a bulk part where we get smallness from the frequency gap, and a remaining 
part where this does not work. For this remaining part we can easily produce a divisible 
bound. Unfortunately, this last argument uses modulation localizations. 

(ii) Secondly, we consider the changes in the previous arguments when the analysis is done 
on a compact interval I. The challenge here is to be able to accurately estimate the 
large but divisible part using only information localized to our interval. 

8.1. A review of jT3|. We decompose Diff^ into 

Diff™ = K *DiffX + (I - U*) Diff£ 

where the operator H*, introduced in [IS], selects the case where A has high modulation 
while both the input and the output have small modulation, 

n* Diff™0= ^ a Q<j-2 d M 

ko<k—m j<ko 

The better part {I — 1-1*) Diff^ can be still dealt with in a bilinear fashion using the following 
result: 


Proposition 8.1. We have the bilinear estimate 



||(/-?T)DiffM J v < (114*11/1* + ||VA o || £1l2h i)||0|Ui . 

(8.1) 

Further, 

we have the low modulation improvement 



110 - < 2-“'(||A,|| PS , + ||V4,|| (li2ft , ), IIVIls. 

(8.2) 

where 




k 0 


The bound (18.ip is the sum of the bounds (54) and (58) in [15] . The bound (18.2p is 
a corollary of the proof of (54) [13]; it follows from a similar improvement in the bound 
(132) [13], which in turn is tied to the fact that the estimate (143) [13] is only used in the case 
when ji < k? — m. 

We now turn our attention to the term "H*Diff^, where it is no longer enough to obtain 
bounds depending on the above norms of A. Our first tool here is the intermediate norm Z , 
which has the following properties: 
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Proposition 8.2. We have the bilinear estimates: 



\\n*Bis^j\\ N < HArlUII^IUi 

(8.3) 

respectively 

II^Diff^lU < \\Ao\\ A - hahz+LlL jm S i 

as well as the low modulation improvement 

(8.4) 


||^Diff^|| Ar <2- cm ||A,|| z ||^|| 51 

(8.5) 

respectively 

||^Diff- r ^lk < ■ 

(8.6) 


These are the bounds (133) and (140) in [T3] , where the low modulation improvement is 
again apparent from the proofs. 

Combining the estimates (18. ID , (18.31) , (I8.4j) and (13. 6 p we can eliminate the modulation 
localizations and obtain 


Corollary 8.3. The following estimate holds: 

UDiff^'V’ll jv ^ (ll^a-11^5 1 + IPArll^zyL 2 + l|VA)||^ li2i ji + ||^o||l 1 l°°)||V’||s 1 (8-7) 

Using this estimate, we can dispense with the cubic contributions due to A 3 = (Ajj, A 3 ). 
Indeed, combined with the bounds (14. 5 p . (14. lip and (I4.16p . as well as (13.8p to control 
||A 3 || £ 1 S 1 , we can use (18.7p to establish (I4.29P . We remark that the frequency envelope 
bound in (14.29ft is clear from the frequency gap m between the two inputs A 3 and <f>. 

The output of the quadratic part of A cannot be all dealt with using the Z norm, but a 
good portion of it is amenable to this strategy. This is described using the operators FLk 0 
defined by 

Td-ko A"(^fei, 0fc 2 ) ^ ( QjPkpA-'(Q<j<pk l , Q<j(pk 2 ) 

j<k o 

Precisely, the portion of A 2 which does not have good Z bounds is 

n m A 2 = ]T n k 0 A 2 (0 fcl > fc2 ) 

ko<ki—m 

A key result in [[T3J is to treat the output of this part in a genuine trilinear fashion, taking 
advantage of a cancellation between the A 0 and A x parts, which have otherwise been treated 
separately. Precisely, we have 

Proposition 8.4. For any admissible frequency envelopes c, d, e, we have 

< 2-HWklMU.IMk,, (8.8) 

where f(k) is as in (I4.30p . 

For this we refer the reader to the estimate (60) in [T3j and its dyadic versions (136)-(138), 
where the frequency envelope bound and the gain with respect to m are apparent. 

Hence it remains to bound 

||(/-H m )A 2 (0,0)|U +LlL oo, ||(/-H m )A 2 (0,0)|| A _i a i z+ilLoo . 

Considering the dyadic portions 

Pk 0 A (0fc L , 4>kf)i 
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the case of high-high interactions was also discussed in [13]. Precisely, from the bounds (134) 
and (141) in J3j we have 


Proposition 8.5. For k 0 < k\ — C, we have the dyadic bound 

II (/ - +1| (/ - w m )n o AS(fc,0 b 

<2- < l‘”-‘‘l||^|| S i||^ fe || s i . 


1 1 


(8.9) 


This suggests that we should decompose A 2 into a high x high —> low portion and a 
better reminder. We will be more accurate and set 

A 2 = ^ P ko A~(cj) kl , (j) k2 ) + ^ P ko A~((j) kl , 4> k2 ) + Pk 0 A 2 ((j) kl , 0 fc2 ) 

ki>ko~\-m ko—m<k\, 2 <ko-\-m k rn i n <kQ—m 


_ j^2,hh(m) _j_ ^2 ,med(m) _|_ ^2 ,hl(m) 


where k min = minj/c!, k 2 }- Note that no modulation localizations are present here. 

We first handle the part A 2, hh(m) ' R eca u f rom the proofs of (14.41) . (I4.10p . (14.161) and (14.171) 
that there is a bound with an off-diagonal decay of the form 

llUXOfe.^Jlls- + l|i%,Aj(^,^ fa )|| iIAi < 2- < <l*»-*‘l+l‘«-‘il)||0 li || s ,||^|| SI (8,10) 

when ko < k\ — C. 

Combining the bounds 08.8}) . (I8.9j) with (I8.3j) . (I8.4j) for the Ft* portion and (I8.10j) with 
(18. ip for the / — Ft* portion, we obtain the following: 


Corollary 8.6. For any admissible frequency envelopes c, d, e, we have 

IlDiff^Mm) (^,02)^11 N f < 2- cm ||0 1 |Ui||0 2 || 5 i||^||si (8.11) 

where f{k) is as in (I4.30p . 

Again, no modulation localizations are present here. 

The remaining parts of A 2 have no contributions from FL m A 2 , so we will estimate them 
entirely using the Z norm or the simpler L 1 L°° bound. The latter suffices in the case of A 0 , 
whose dyadic pieces are readily bounded by 


H-PfcoA-o(</>fci, </>fc 2 )IU 1 L°° ~ 2 2/u '° ||P fco AA5(</) fcl , 0 fc2 )|| L i L oo 

< 2-2(1,PJ IKpi-fy^ \D\-id t 4, kl )\\uL~ (8 ' 12) 

3= 1,2 

where k max = max{fc 1; k 2 }. In both A ( 2,/, ^" t) and ; note that we have fc max < k 0 + m 

for (say) m > 3. Moreover, the above dyadic bound sums up easily due to the off-diagonal 
decay. Using the embedding ZTL 2 C N, we easily obtain 


Corollary 8.7. For any admissible frequency envelopes c, d , e, we have 
||Diff^ 2 ,w (m) (0i, 0 2 )V’||iv / + ||Diff™2,med (m) (0i, 0 2 )'0||at / < 2 C n\M DSi \\U DS ijmsi (8-!3) 

A 0 a 

where f{k) is as in (I4.30|) . 

We now consider the contributions of A 2,h/(m ^ and js L f med(m) Our hrst tool is due to the 
estimates (134) and (135) in [13], which give 
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Proposition 8.8. The following estimate holds: 

WPkoA-K^Mh < 2 c ^ ko ~ kmax 12 _5 l fcl_fe2 111 <j) kl 11 5 i 11 (j) k2 11 s i (8.14) 

This gives a gain for the high-low portion of A x . Hence in combination with (18.31) , (18.4p . 
(18.9(1 for the Ti* portion and (18.11) . (18.101) for the / — TL* portion, we obtain a result with no 
modulation localizations: 


Corollary 8.9. For any admissible frequency envelopes c, d, e, we have 

||Diff^ 2 ,w ( m ) (0 1 , 0 2 )'0||iV/ < 2- cm ||0 1 || s i||0 2 || 5 i||^|| s i (8.15) 

where f(k ) is as in (14.301) . 

Finally, it remains to consider the contribution of ^> med ( m \ There the estimate ((8. 14[) 
suffices for the bound (I4.29p . but provides no divisible norm estimate. To summarize, we are 
left with the case 

k 0 — m < k\, k- 2 < k 0 + m. 

Here we can take advantage of the low modulation decay in (18.2p and (18.5p to obtain 


Corollary 8.10. The following bound holds for large enough C and k 0 , k\, k 2 as above: 


where 


l|Diff™ 2,me d (m),low^ k ^ ^ 2 ^ \ \ (j) kl \ \ £l \ \ 0 fc 2 \ \ £l \ \ lf k \ \ 5 I 


\ 2 ,med(m) ,low \ PA 2 ,med(m) 

A x — / J '^C<ko-mJrk 0 - t ^- x 

k 0 


(8.16) 


Thus we can restrict ourselves to high modulations in A x , i.e., 


A*~**»^(*,*) = X 

ko 

For this part, we can use the L l L°° norm. Precisely, each dyadic piece obeys the estimate 
\\PkoQ>ko-Cm-A-l((j)k 1 , 0fc 2 )||LiL°° <2~ 2fc ° +Cm || DA l((f kl , k 2 ) \\ L 1 L°° 

<2 Cm |||Zl| 2 0 A . i || i2Loo |||T)| 2(j)k 2 \\ L 2 L oo . 


Recall that we are in the scenario ko — m < ki, k 2 < ko + m. Combined with the embeddings 
L l L 2 C N and VS 1 C L°°L 2 , we obtain 


11 Diff ™2,med(m), high ^ I \ N f ~ 11 ^ 11 DSl 11 011 DS\ 11 V’fe 11 S\ 


where c, d, e are any admissible frequency envelopes and / is as in (14.301) . Thus the proof of 
Proposition 14.71 is concluded on the entire real line. 


8.2. Interval localized bounds. Here we seek to prove the result of Proposition 14.71 in 
a time interval I. Due to the paradifferential nature of the operator Diff^, we can fix the 
frequency 2 k of the input if and simply estimate the expression Diff ffifk- For A we consider 
its components successively: 

a) The cubic terms A 3 . Here we simply extend A 3 outside / as a homogeneous wave, and 
then use the bound (18.71) . By Propositions 14.11 14.41 and 14.51 we know that A 3 is entirely 
estimated by divisible norms. 
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b) The contributions of A_ 2 ’ hh ( m ) anc i p^ hl<dm \ Here we extend (j) outside / as a homogeneous 
wave, and then apply (18. lip , respectively (18. 15[) . 

c) The contributions of A 2 ’ med ( m '> anc i These are estimated directly via (18. 13[) : no 

extensions are necessary. 

d) The contribution of jsJ ^ nud ' rn> . This is the part where the divisible bound is more 
difficult to gain. In what follows, we simply write A^, = A^. To review, we have to estimate 
the expression 

||P fc0 A'(^i, 0fc 2 II N 

where the frequency balance is 

k 0 < k — m, k 0 — m < k\, k 2 < k 0 + m . 

This is where the length of the time interval / plays a role. Comparing it to ko, we distinguish 
two scenarios: 

(i) Short time intervals, |/| < 2~ feo+m . Then we have a direct estimate, 

|| -Pfco A {(pki i ^k^^i^k || N[I] || Pko A ||l 1 L 2 [7] 

~l^l 2 II Pko Aa,(0fci j 0fc 2 ) || L 2 L°°[I] || ^ x^Pk 11 L°°L 2 [I] 

< 2 Cm \\ \D\ iP ko A x ((p kl , 0fc 2 )||L 2 L°°[/]|| • 

Summing over k 0 ,ki, k 2 and recalling the definition of the C S ko , we obtain 

11 Diff^2,med (m) ( ^ 1 ^ 2) V’fc 11 JV[i] < 2 Cm || VF <fc _ m A« m )(0 1 , MMlMsm, 

The right hand side can be controlled by Proposition 14.11 The splitting into small and large 
parts is then achievec0 by using the corresponding statements (14.6ft - (14 .7ft for A 2 . 

(ii) Long time intervals, |/| > 2~ ko+m . This is the difficult case. Our proof here in¬ 
volves modulation localizations, so we need to consider appropriate extensions of A x and 
ijj k . Since kp k is an independent variable, for it we can simply use the canonical extension as 
homogeneous waves. For A x , instead, we extend its arguments </>i and (j) 2 as homogeneous 
waves. 

The bound (18.161) suffices for low modulations of A 2,med ^ h \ therefore it suffices to estimate 

^ ' ||(3>fco— CmPko A a ;(0fc 1 , 0fc 2 ) ^ x'fik 11 JV[/] ^ 

ko<k—m 

\\Q>ko —CmPko A x (0fci, 0fc 2 )||L 1 L°°[/]|| ^ x^kW^L 2 

ko<k—m 

To estimate the localized L l L°°[I] norm we write 

Q>ko—CmPko A x {(f) kl: 0fc 2 ) Q>ko~ CmPko I—I ^ ^ ^®0fc 2 

= 2- k0+2Cm L (<j )kl , ( p k2 ) 


10 Technically, (14.611 - 114.71) apply to the full operator A x . Nevertheless, A x — A x med ^ gains 2 _cm by 
(18.101) . and thus this difference can be put into the ‘small’ part. 
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where L is a bilinear translation invariant form whose kernel is localized near 0 on the 2 ko 
scale in space-time. This allows us to estimate the tails outside / as follows: 

||Q>fco-Cm-Pfc o Ax(0fci, 0fe 2 )IU 1 i oo [/] ~ 2 Au 2 Cm ||x5 O 0fci|U 2 L°°||X/ U 0fc2lk 2 L 00 - 

Since k\ and k- 2 are close to k, we conclude using Proposition 13.41 that 

ll<2>fco —CmPko A x .(0fc l; 0fc 2 )||L 1 L°°[/] 1$ ^ Cm \\ ( l ) k 1 \\DS 1 [I]\\ ( t > k 2 \\DS 1 [I] 
which is the sought after divisible bound. The proof of Proposition 14.71 is concluded. 


9. The paradifferential parametrix 


The goal of this section is to prove Theorem 14.91 Instead of producing an exact solution 
operator, it is easier to produce parametrix with small errors. Then the exact solution is 
obtained in a straightforward iterative fashion. The result we produce here is as follows: 


Theorem 9.1. Let A x be a Coulomb magnetic potential solving the free wave equation with 
energy E, and letm > 5. Consider any finite energy initial data (0 O , 0i) localized in frequency 
~ 1, and a source f G N which is localized in frequency ~ 1 and modulation < 1. Then 
there exists an approximate solution 0 so that 

II0||so II(0o,0i)||l 2 xl 2 + 11/11%) , . 

H0[O] — (00) 0i)||l 2 xL 2 + IPa” 1 / — /!!% 2 Cm (|| (0o, 01) || L 2 xL 2 + ||/||%)- 


We remark that the frequency support of the approximate solution 0 is only slightly larger 
compared to 0o, 0i and /; it is essentially also localized at frequency ~ 1 and modulation 
< 1. After choosing m sufficiently large, Theorem 19.11 directly implies Theorem 14.91 see [T3l 
Proof of Theorem 6.3]. 

The definition of our parametrix is identical to the one used for the small data problem 
in [42], which was based on [19]. The main difference is in the source of smallness for the 
errors. In [13] this comes from the smallness of the energy of A. Here, we rely instead on 
the frequency gap m, which must be large in terms of the energy E. 

The parametrix is constructed using pseudodifferential operators with rough symbols. 
Given a symbol a(t, x, r, £), its left- and right-quantizations are denoted a(t,x,D) and 
a(D J y J s), respectively. We also use the standard convention Dy = 

To prove the theorem it suffices to consider initial data 0[O] and source / with frequency 
localization in fa 1, and construct the approximate solution 0 with a similar localization. 
Thus we work with the unit-frequency localized paradifferential magnetic wave operator 

□T = □ + 2 tA{_, n d r (9.2) 


where A solves the free wave equation OA = 0 with initial data H[0] G H x x L 2 X . 

Given an additional small angular localization parameter 0 < a < 1/2, we construct a 
parametrix for (j9.2|) as follows. For £ G M 4 we define 


e 


A . — A 


. V7 


W . 












Define the angular sector projection n> e by the formula 

•Un> e /](?) := (1 - (l - <?(AAA))/(f). 

It is important to note that if / is real, then so is 11> 0 /. We also define 

n£ := n". - n“ e , := 1 - n^. 

-"2 — 

For each £ < 0, we define yj £y ± to be 

^r,±(£, x, 0 := ±L^A~iU^ 2rre (uj ■ P £ A ) (9.3) 

The full phase ip± is then defined to be 

•0±(M,£) = := ( 9 - 4 ) 

i<—m. 

Note that we have 

L%$ ± = ± 53 • P t A) (9.5) 

l<—m 

In other words, ip± represent roughly the output of the integration of the (bulk of the) 
magnetic potential A along light rays. Here we exclude the output of small angle interactions, 
which is on one hand perturbative, and on the other hand would yield a bad dependence of 
i[)± on £. This is akin to symbol smoothing for rough pdo’s. 

We use the pseudodifferential gauge transform 

e <o £ ' ± (^, x , D) : = (5 , <0 e _ ^’ ± )(f, x, D) 

where S^o is taken with respect to the (t, x ) variables of the symbol. Its dual is 

e<c f(D,y,s) 

As the symbol is independent of the time Fourier variable £o = t, we see that the left and 
right quantizations with respect to t are the same, i.e., 

e <o (A Vi s ) = e <o (t, D, y ). 

The operators ef?Q ± (t, x, D), respectively e^f (D, y, s ) are used on the left and on the right 
in order to conjugate the paradifferential operator □ p fi n to the d’Alembertian □. Precisely, 
our parametrix is given by 

(j)(t,x) = e^’ ± (t,x,D)\D\~ 1 e ±ltlDl e^(D,y,0)(\D\(f) O ±(j) 1 ) 

+ \ So e ^ ± ^ t,x,D ^ D ^ le±l{t ~ Sme <^^ D,y,s ^ ± ^ s ^ ds ^ 

To show that the above parametrix satisfies the bounds in Theorem 19.11 we need the 
following mapping properties for the operators eff^ft, x, D), respectively (D, y, s): 

Theorem 9.2. For m > 0, let be defined as in 119.4)1 . Then the following mapping 
properties hold with Z e {N 0 , L 2 , N^}, with implicit constants which depend on the energy 
E of A: 

(1) (Boundedness) 

e<£*{t,x,D)\ Z^Z 
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(9.7) 






(2) (Dispersive estimates) 

e^'(t,x,D): Sl^S 0 (9.8) 

(3) (Derivative bounds) 

(VeJS^XM) : 7 -G 2~ m Z (9.9) 

(4) (Approximate unitarity of e 1 ^ on L x ) For each t G M. we have 

e~^(t,x,D)e‘**(t,D,y) - I : L\ -> 2^-^Ll (9.10) 

(5) (Approximate unitarity of e l ^ ± on N) We have 

e^ ,± (t,x,D) ( ^(D,y,s) - I : N 0 -G 2 ~^ m N 0 (9.11) 

(6) (Parametrix error estimate) We have 

e^’ ± (t,x,D)D - n p A e^(t,x, D) : S{ ± -G 2~^ m N 0 , ± (9.12) 


Remark 9.3. The small constants a, 80 , 81,82 and 8 are now different from those used in the 
earlier part of the paper. They are chosen in the following logical order: a, h 0 , hi, 8 2 . On 
the other hand, we reserve the symbol 8 > 0 for a free small number, whose value may vary 
depending on the usage. 

This result mirrors Theorem 3 in [13] , with the key difference that the smallness is now due 
to the frequency gap parameter m. Assuming these bounds, the conclusion of Theorem 19.11 
follows in the same way as in [13]. 

To prove the above theorem, we may directly borrow the estimates from [T3] which do not 
involve smallness, namely (19. 7[) and (19.81) . The implicit constant in these estimates will now 
depend on the energy E of A. The remainder of the section is devoted to the proof of the 
new bounds (19. 9[) . (I9T01) . (19. lip and (19. 121) . 


9.1. Review of decomposability calculus. Here we give a brief review of the notion of 
decomposable symbols developed in HU 021 133], which provides a convenient way to keep 
track of mixed L^L^-type bounds. The particular version we use is from [TU 13] . 

Given 6 G 2 Z ~, where Z_ denotes the set of nonpositive integers, consider a covering of the 
unit sphere § 3 = {£ : |£| = 1} C R 4 by solid angular caps of the form {£ G § 3 : \(f — ||i| < 0} 
with uniformly finite overlaps. We enumerate these caps by the centers <f G § 3 , and denote 
by {6j(O}0 the associated smooth partition of unity on § 3 . 

Consider a smooth symbol c(t, x; £) which is homogeneous of degree zero in £, i.e., depends 
only on the angular variable c 0 := j|y. We say that c(t,x;£) is decomposable in L q L x (where 

1 < g, r < 00 ) if there exists an expansion c = X^e 2 z - c ^ suc h that 

\\c {e) \\D e L?Lr < OO, (9.13) 

d£ 2 Z - 


where 


l cW ll D g L*L* 
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k =0 <t> u 


I L*- 


(9.14) 


We denote the class of such symbols by DL q t L r . For c G DL q t L r x , we define the norm ||c||£> L 9 L r 

by taking the infimum of (19.131) over all possible decompositions c = ^ eg2 z - . 
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The class DLfL r x provides a convenient framework for establishing L^L^-type estimates for 
pseudo-differential operators arising from products of symbols. The following lemma collects 
the key properties that we need. 


Lemma 9.4. The following statements concerning the class DL q t L r x hold. 

(1) For any symbols c G DL qi L ri and c G DL q2 L r2 , its product obeys the Holder-type bound 


°d\\DL^L% ffj IMIIdL® 2 L? 


where 1 < c/i, q 2 , Q , r 1 , r 2 , r < 00 , — + — = - and - + — = 

— — 1 qi q 2 q r 1 1 7*2 r 

(2) Let aft, x; £) be a smooth symbol whose left quantization aft , x\ D ) satisfies the fixed time 
bound 


sup \\a(t, x] D)\\ L 2^ L 2 < A. 

t 

Then for any symbol c G DL q L x , we have the space-time bounds 


\\{ac)ft,x-D)\\ L <n Ll ^ L <n L7 < A\\c\\ DL <i L r 

where 1 < qi, q 2 , q, r 2 , r < 00 , -A + ~ ^ and \ + \ = qq- An analogous statement holds 

in the case of right-quantization. 


For a proof, see [T2J Chapter 10] and [13] Lemma 7.1]. 

We borrow another lemma from [T3], which relates the product of quantized operators with 
the product of the corresponding symbols within the framework of decomposable symbols. 


Lemma 9.5. Let a(t, x;£), bft,x;£) be smooth symbols, where we assume furthermore that 
a is homogeneous of degree zero in f. Then we have 

II a{t,x;D)b{t,x\D) - (ab)(t, x; D)H L w L 2_> L * L 2 

<|| (dfa)(t, x; D)\\ DL ^ L ^ || {d x b)(t, x\ D)\\ L ^ L 2 ^ L n Ll 

where - = — + An analoqous statement holds in the case of right-quantization. 

q qi q2 ^ J d 'l 

For a proof, see [13] Lemma 7.2], 


9.2. Symbol bounds for if. We first consider the size and regularity of the dyadic pieces 
of ipk,z fc> namely 

:= (n gll>k,±){t,X,£). 

Given the symbol dependence on the angle, it is useful to keep in mind that the size of 
'ifk, ±(M,0 is roughly given by 

^~2- fe r 2 n f{u-A k ). 

We borrow the following decomposability estimates for the symbol from [ T3] : 


Lemma 9.6 (Decomposability estimates [13] Section 7.3]). For ^ | we have 

ll(<i.2“‘v^)ll DL1L.S < 2-<i 

Moreover, for any f3 > 0, we have 


(9.15) 






E. 
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(9.16) 


In particular, for q > 4, 


ll^'WMI dlil- < (9.17) 

Remark 9.7. From the decomposability bound (19.1711 with q = oo, (19.91) follows easily. 

We also collect here additional symbol bounds which are cruder but useful for estimating 
oscillatory kernels: 

Lemma 9.8 (Symbol bound for [13] Section 7.3]). The following symbol bounds hold. 

(1) For any a, (3 > 0 and 2 < q < oo we have 

||9r 1 V9|V>£lli<i- ! < -f>E. (9.18) 

When a — 0, we interpret the expression on the left hand side as 

(2) For q > 4 and 1 < {3 < cr _1 (l — 1), we have 

< 2 ( -i + “ )fc 2(9.19) 

(3) For 1 < /3 < a' 1 , we have 

-il)±(t,y,£))\ < {x-y) a{ P-^E. (9.20) 

9.3. Fixed-time L 2 bounds. Here we prove (19.1011 . For later use, we prove the following 
stronger result: 


Proposition 9.9. For sufficiently small a > 0, there exists do > 0 such that the following is 
true: For every £, k < 0 with i + C < k, we have 


3 -#<a±, 

'<k 


(t,x,D)e‘*<‘- ± (D,y,t) - l)Po||Lg_i| <e + 2 10 ‘'-‘>. 


(9.21) 


where the constant is independent of k , l. 


We remind the reader that e<f(D,y,s) = e^f (t, D, y), since the symbol is independent 
of r = £o- I 11 particular, this pseudodifferential operator makes sense on every fixed time 
slice. Note that (19.1011 follows by taking k — 0 and noting that if)± = ^<_ m . 

To begin the proof of Proposition 19.91 we prove a closely related estimate which does not 
involve space-time Littlewood-Paley projections for e l1 f <£ ’ ± . 


Lemma 9.10. Let i < 0 and a(D ) be a multiplier such that a(£) is a smooth bump function 
adapted to {|£| < 1}. Then we have 

|| e -#<€,±(£, £, D)a(D)e i ^(D,y,t) - a(L>)|| Lg _^ < £ 2< 1 -*>>*. (9.22) 

Furthermore, for any k e M we have 

\\e- i ^(t,x,D)P 0 \\ L ^ L 2 < E 1, (9.23) 

\\ e 2^ ± (fx,D)P 0 \\L^L> <e 1. (9.24) 


Proof. We first reduce (19.2311 and (19.2411 to proving (19.2211 . By a TT* argument, ()9.23j) is 
equivalent to Lf boundedness of e _ ^ < ^ ± (t, x , D)Pf;e l ' l)<t ' ± (D 1 y, s), which follows from (19.2211 
and the L 2 X boundedness of a(D) = Pf. Next, note that 

J e- # <*’±((t, x) - 2 :, tf)2 bk m(2 k z) d 1+A z 
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where m(z) is the kernel for S < o- As the hypotheses for (19.23P is obviously invariant under 
translations, the left- (and also right-) quantization of each x) — z,g) obeys the 

same bound as (19.231) . Therefore, by the rapid decay of m(-), (19.24)1 follows. 

The proof of (I9.22p is an easy consequence of non-stationary phase, tha nk s to the fact 
that cr > 0 can be taken arbitrarily small. The kernel of the operator in (I9.22p is given by 


K±{t,x,y) := C J - 1 )a(f)e iHx ~ y) d 4 f 
=C j j\±e i ^(t,x,t,y,0a(0e iHx - y) dpd 4 £ 

where 

T ± (t, x, s, y, £) := x, £) - ^ «,±(s, y, £)• 

We divide into two cases, namely when \x — y\ < 2~ S£ and \x — y\ >2~ se . 

Case 1: \x — y\ < 2~ Si . In this case, using (19.181) with q = oo and |a| = 1 and j3 
each frequency t’ and summing up in t' < i, we obtain 


Since is real-valued and suppa C {|£| < 1}, it easily follows that 


K 1 (t,x,y)\<2^E for \x - y\ < 2~ Si . 


= 0 for 


(9.25) 


Case 2: \x — y\ >2 5e . Here we integrate by parts in £ for TV-times and use the bound 
(19.201) . Then we obtain 

\Ki(t,x,y)\ S,iv,E - - f or \x - y\ >2~ se . (9.26) 

\x — y ^ 2 ® 

Combining Cases 1 and 2, it follows that 

sup f \Ki(t,x,y)\d 4 y + sup j \K\(t, x,y)\ d 4 x < E 2 (1_<So)£ 

x J y J 

if cr, 6 are small enough and N is sufficiently large. Estimate (19.221) now follows. □ 

Next, we borrow a lemma from [13], which is useful for handling e k <l:± when k > L 


Lemma 9.11. For £+ C < k and every i E M, we have 

\\e- k i ^ ± (t,x,D)P 0 \\ L ^ L 2 < E 2 10 ^~ k \ 
Furthermore, for 1 < q < p < oo and i + C < k, we have 
\\e^ <e ' ± {t,x,D)P 0 \\ L p L 2^ L ', L 2 < E 
These estimates also hold for e k l ^ <e,± (D,y, s). 


(9.27) 

(9.28) 


Remark 9.12. The specific factor 10 in the gain 2 10( ^~ fe ) is irrelevant, but it is important to 
note that this number is much bigger than 1. This will be very useful in our proof of (19. lip , 
where we will use this factor to dominate smaller factors. In fact, a variant of the proof below 
allows us to make this gain as large as we want, by making the implicit constant larger. 
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7 ( 1 ) 


, S[ 6 \ S ( ,!\ ..., Sf, which obey the same 


Proof. Consider frequency projections 5^. , 
bounds as S*, and furthermore satisfy 

SP:=S„, S® = 2- 2 ‘5f (fl? + A), gf = sf S< i+1) 
for i = 1,..., 5. Thanks to the assumption l + C < k, we may write at the level of symbols 

=2 ~ 2k s£\d 2 t + A) e -^-± 

—2~ 2k S^\—2iAif <et± - |V^,±| 2 )4 2) e- # <^ 

5 


= • • • = 2 


-10fc 


n [s t fa) (-2iAV>«,± - |VV><,,±| 2 )' 


i=i 


= -*h<£,± 


Here we used the fact that a:, £) solves the free wave equation 1 = for 

each £, since A does. Disposing of the nested projections by translation invariance, using 
the decomposability bound (19.17(1 and L 2 X boundedness of (£, x, D)P 0 , the desired 

estimate follows. □ 

We are now ready to prove Proposition 19.91 

Proof of Proposition [Ql Thanks to the frequency localization of the symbol e<jf e,± (s, y, £), 
note that we can harmlessly put in a multipier a(D) whose symbol is a smooth bump function 
adapted to {|£| < 1}. The operator in (19.211) therefore equals 

(e^’ <t ’ ± (t,x,D)a{D)e^ t ’ ± {D,y,s) - a{D))P 0 . 

For the purpose of proving (19.21 jl . we can safely dispose Po on the right. Next, note that 

e-^.± (t, x, D)a(D)e(D, y, s ) - (t, x, D)a(D)e^< £ ’ ± (D, y, s ) 


= e 


>k 


(t, x , D)a(D)e i ^(D, y, s ) + (t, x, D)a(D)e^ ± (D, y, s ). 




By Lemma [9.111 and (19.231) . the operators on the right hand side obey 

\\e^ ± (t,x,D)a(D)e i *<‘.*(D,y,s)\\^Ll<Er^- k \ 

Combining these bounds with (19.221) established in Lemma [9.101 (19.21)1 follows. □ 

9.4. Space-time bounds. Here we establish (19.111) . More precisely, we will show that: 

Proposition 9.13. For a > 0 sufficiently small, there exists hi > 0 such that the following 
holds: For j < C, we have 

|| Q J [e^ ± (t,x,D)e i ^ ± (D 1 y,s) - l]P 0 Q<o\\ N ^ x o J/2 < E 2~ s ' m . (9.29) 

The estimate (19.291) proves the part of (19.111) . Note that the LfL 2 x portion of (19.111) 

follows immediately from (19.10p . 

To ease the notation, we omit writing ± in ip±. Also, we omit the dependence of the 
constants on E. It will be convenient to define the compound symbols 

T(t, x , s, y, £) :=il>(t, x, £) - y, f), 

V<£(t, x , s, y, 0 X , 0 - y, £)■ 
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The symbol dp is defined in the obvious way. 

Given a compound symbol a(t,x,s,y , £), we define the double space-time frequency pro¬ 
jection 

(tj Xi Vi 0 • ^<k^<k ^(^> U Vi 0 1 

where 5'^ is the space-time frequency projection applied to (t,x), etc. Therefore, according 
to our conventions, 

(*, x, D, y, s ) = e~f (t, x, D)e^ k (D, y, s ). 

We begin with a lemma for frequency localizing the gauge transform e _i4 ' <£ , which will be 
used several times in our argument. 


Lemma 9.14. For 2 < q < oo and i + C < k < 0, we have 

ll( e «? <£ - <Ql ‘OalUfLl^L-Ll < 2 ( 5-5)'2 1 "l'- t >. (9.30) 

Proof. Proceeding as in the last part of the proof of Proposition 19.91 we may write 


— 

5 <C 


<f 


— 

'<fc 




)^0 


= [e<c’ <f (t, a;, D)a(D)e l ^ e (D, y, s) - e^ <l {t, x, D)a(D)e l ^ e (D, y, s)]P 0 

= U D M D )e<c*( D , y, s ) + x, D)a(D)e^f <c (D, y, s)]P 0 

where a(£) is a smooth bump function adapted to {|£| < 1}. Then (I9.30jl follows from 
Lemma 19.111 □ 

We are now ready to prove Proposition 19.131 

Proof of Proposition \9.13[ We proceed in several steps. Let 6 > 0 be a small number to be 
determined later. 


Step 1: High modulation input. For j' > j — C, we claim that 


\\QA e <lo (t,x,D,y,s) - i]PoQj’\ 


n*^x: 


0,1/2 




(9.31) 


Using the portion of A)), (19.3111 follows from 


II Qii<! - mQfWLl^Ll, < 2-0-*)”*. 

Since Qj,Qj' are easily disposable, this estimate follows easily from (19,10)1 . 

Step 2: Low modulation input, —\m < j < C. In this step, we take care of the easy 
case —\m < j < C. Under this assumption, we claim that 


IIQ?[ e <o (t,x,D,y,s) - 1 }P 0 Q <j -c\ 


N*^-X, 


0,1/2 


2~ Sim 


(9.32) 


Note that 


Qj[ e <<j-c ~~ l]-fb Q<j-c — 0 


by modulation localization. Using the portion of N* } . it suffices to prove 


11 Qj [ e «o - e «/-G] p oQ<j-c\ | L? 


•Ll^L 2 t 


t,X 




Since Qj , Q<j-c are disposable on Lf x and L^°L X , this estimate follows from (19. 30ft and the 
fact that T = T<_ m . 
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Step 3: Low modulation input, j < — |m, main decomposition. Henceforth, we 
consider the case j < — | m . The goal of Steps 3-6 is to establish 


\\Qj[ e <o (t,x,D,y,s) - e 


-niv,- 


<o 


<j—8m 


(t, x , D, y, s)]P 0 Q<i-dL_ x o,i / 2 < (9.33) 


At the level of symbols, we begin by writing 


e - e = - i 


dh e di - 


f£>j—8m 


V e Ve' dl'dl 



' £>£'>j—5m 

'HiVe'Se' e~ i9 <*" dl" dl'dl 


' £>£'>£"> j—5m 

=:C + Q + C. 


We treat £, Q and C in Steps 4, 5, and 6 , respectively. 

Step 4: Low modulation input, j < — |m, contribution of £. In this step, we prove 

\\Qj£<s:o(t, x ,D,y,s)P 0 Q < j-c\\ N *^ ) . x °^/2 ^ 2 5im . (9.34) 

We further decompose £ as follows. We first separate out the low frequency part of the 
gauge transform, then decompose according to the frequency of (depending on whether 
l is higher or comparable to j), and finally replace the gauge transform by 1 : 


£ = — i 

1 ( e ~ i9 <^ - e^7-T 5m ) M 

J €>j—5m 

(9.35) 

- i 

J £>j-\-105m 

(9.36) 

- i 

[ (e^-T Sm - 1 ) di 

J j—5m<i<j+ 10 5m 

(9.37) 

- i 

f ^ t dl 

J j—5m<£<j-\-105m 

(9.38) 


—: Ci + £ 2 + £3 + £4. 


We treat the contribution of £4,..., £4 separately. 


Step 4.1: Contribution of £ 4 . For £ 4 , the double frequency localization (£i)<o an d the 
fact that l < — m < 0 allow us to write (at the level of symbols) 




<Cj — 8m 


— e «j-c~ Sm ) di 


(£i)<o— (-*/ *Ue«c' ^ „ . . 

' J l>j—5m ‘ 

As the rest of the argument for £4 will be translation invariant, we can easily dispose the 
double frequency localization (-)<f. We are now reduced to proving 


liTr ( e 


-i’J' 


<Cj — 8m 


'«j-c 






for t > j — 8m. This estimate follows from the decomposability bound (19.17jl with q = 6 
and (19.30P with (p, q ) = (00, ~). 


Step 4.2: Contribution of £ 2 . Before we begin, note that the double frequency localiza¬ 
tion (-) <0 does nothing to £2, £3 and £4, thanks to their frequency localization properties. 
Therefore, we drop (-)<o from now on. 
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In this step, the desired gain in m will be obtained from £ > j + 10 8m, and we do not 
exploit the difference structure in d£. In fact, in order to apply decomposability bounds, we 
divide dq(£, x,s,y , £) = if)e(t, x, £) — ipe(s, y,£) and treat each term separately. Here we only 
consider the case ipi{t,x, £); the argument for the other case is analogous. 

Thanks to the frequency localization e^P^ 5m , the contribution of ^(£, x, £) in the inte¬ 
grand in 09.361) equals 

where PoQ<j~c is a slightly enlarged version of PqQcj-c- Then by the frequency localization 
of e^Pn Sm (now applied to the one on the left), the modulation of the output forces an 

2 -|h-i)+. 


'<j-c 


angular separation between the spatial frequency of x,£) and £ of the size 
Therefore, the preceding operator equals 

where we dropped PoQ<j-c as it is of no more use. Using fixed-time L 2 X boundedness of 
e l ^<i~ Sm p 0 an( ] the decomposability bound (19.151) summed over 9 > 2~^ ( ~ j ' )+ , it follows that 




^eJl<S~ Srn )(t, x, D)e i l < ic Sm (D, y , s)P 0 Q <3 . c \\l T li^ 


(9.39) 


for £ > j. Now integrating this bound over £ > j + 10 8m, we obtain a gain of 2 4 <5m from 
the factor 2^^~ e \ which is acceptable. 

Step 4.3: Contribution of £ 3 . For £ 3 , we make use of the difference structure in the phase 
v b<t-( 5 m, but not for the symbol d£. Thus we again only consider ^(£,x, £). We remind the 
reader that in this case, the ^-integral in (I9,37|> is taken over j — 8m < £ < j + 105. 

Proceeding as in Step 4.2, the contribution of this term in the integrand for (I9.37[) equals 


<j—5m 


C 3 (i) := Q,(n" 2 _^>(e— c 


- 1 ))(t,x,D,y,s)P 0 Q <j - C . 


Then proceeding as in the proof of (19.391) . but using (I9.21[) instead of mere boundedness 
of e <j- < eT' 5m , and integrating over j — 8m < £ < j + 105, we obtain 

|| [ £ 3 (0 d£\\ l T lI-+l\ x < 2-^2 1 i 5m (2 (1 - So)ij - Sm) + 2~ mm ). 

J j—5m<£<j-\-10Sm 

This bound is good if 8 > 0 is sufficiently small, since j < — | m. 

Step 4.4: Contribution of £4. As in the previous step, the ^-integral in (j9.38j) is taken 
over j — 8m < £ < j + 105. Here we make use of the difference structure of the symbol d£. 
The modulation localization properties again allow us to write 

Qj^e(t,x,D,y,s)P 0 Q <j - C = ~ ^(A 2 /, s))P 0 Q<j-c ■ 

9> 

As usual, we can harmlessly put in an operator a(D) which is a slightly enlarged version 
of Pq. We now claim that for 2 < q < 00 , the following bound holds: 

■,V)u rr. nu/m _ „mu( 9 )| 
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|| W(t,x,D)a(D) - a(D)^'(D,y,s)\\ LrL ,. L , Li < 2-5 f 2'r c 


(9.40) 
















To prove this bound, we compute the kernel of ijjf\t,x,D)a(D) — a(D)'ipf\D,y,s). 
K 2 (t,x,y) =C j (^f\t,x,C) ~'ipf\t,y,0) a (0 elHx ^ y) 

=C j J {x-y) ■ (d x il)f ) ){t,px+ (1 - p)y,i)a{i)e < {x ~ y) dpd^i 

=C [ [ d r [{d x il>\ 0) ){t,px + (1 - p)y,£)a(£)]e* <:c ~ y) dpd 4 £. 


Integrating by parts in £ several times and using the symbol bound (19. 1 8[) . we obtain a 
kernel bound which implies (19.401) . 

Applying (I9.40P with q — 2, it follows that 

II QA^Xx, D) - ^(D^^mQ^cWL-L^Ll, < 2-iVe- c . 

Summing over 6 > 2~s(^ _ 4)+ and integrating over j — 5m < i < j + 5m, we arrive at 
|| [ Q^ e (t,x,D,y,s)P 0 Q <j - C d£\\ LrL ^ LL < 2~^^2 CSm . 

J j—5m<£<j+105m 

This is good for sufficiently small 5 > 0, as j < —\m. This completes the proof of (I9.34|) . 

Step 5: Low modulation input, j < m, contribution of Q. Here we prove 


\\QjQ<o(t,x, D,y,s)P 0 Q < j-c\\ N *^ x oA /2 ^2 5im . 

(9.41) 

we begin by further decomposing Q: 


Q = - [[ _ e^~ Sm ) dZ’dt 

J J £>£'>j—5m 

(9.42) 

- [[ V e V e eZjf-di'dt 

£>j-\-105m 

(9.43) 

- [[ Klr Sm - 1) di’dl 

J J j—Sm<£'<£<j-\-10Sm 

(9.44) 

- [[ dVdl 

J J j—8m<£'<£<j-\-105m 

(9.45) 

—■ Qi + 0.2 + Q 3 + Qi- 



We treat each of these terms below. 

Step 5.1: Contribution of Q 1 . Proceeding as in Step 4.1, we have 

(Qi)«o = ( — 


~ e^.tT Srn ) dtdt) 


/ <o 


/ £>£'>j—5m 

and the outer (-) <0 can be disposed by translation invariance as before. Next, by (19.171) 
(with q = 6 for 4^, q = oo for 4v) and (19.3Q[) with (p, q) = (oo, 3), we have 

-i'EV 


-i'& < 

'£*n e «c 


|4b4>(e 


<£j-c IWL^Ll^L* 


< 2~^2~ < ' 10 ^2^ Sm 2~6^~ Sm ~^ 


Integrating over £ > £' > j — 5m, we see the desired gain of 2 ( 10 2 ^ m . 
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Step 5.2: Contribution of <2 2 - As in Steps 4.2, 4.3 and 4.4, (-)<o does nothing to 
Q 2 , Q 3 , Q 4 , and therefore can be removed. Also, in this step we split 4b(f, x, s, y, £) = 
£) — and handle only the contribution of ipe(t, x, £)?/y/(f, x, £), as the argu¬ 

ment for the other parts is the same. 

As in Step 4.2, the contribution of x, x , £) in the integrand in (j9.43j) equals 



We first split 



The first term is good, as we already see an angular separation. For the contribution of 
the second term, we can apply an argument similar to Step 4.2 to conclude that there is 
an angular separation between the spatial frequency of £) and £ of size ~ 2“ 2 : ( 

Therefore, the preceding operator equals 



= : Q2,l(40 + Q2,2(40 


For Q 2 ,\ we use -i/y G DL\L°£ and 11^ G DL^L™, and vice versa for Q 22 ; see 


(19.151) . (19.171) . We also use fixed-time L 2 X boundedness of e<P c Srn P 0 in both cases. Then 



which are good once integrated over {i > (! > j — 5m} D > j + 10 5m}. 

Step 5.3: Contribution of Q 3 . We again only consider ^y(i, x, £)^(t, x, £). Proceeding 
as in the previous step, the contribution of this term in the integrand in (19.441) equals 



We proceed as in Step 5.2, but replace the use of L 2 boundedness of el<ic m Po by d9 m- 
Integrating these bounds over j — 5m < d‘ ' < t < j + 105m, we obtain 


l 


j—Sm<£' <£<j-\-105m 



J j—8m<t r <i<j+lQ5m 

Taking 5 > 0 sufficiently small and using the fact that j < — |m, the desired gain in m 
follows. 


I 
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Step 5.4: Contribution of Q 4 . Proceeding as in Steps 5.2 and 5.3 for every possible 
contribution of 

and recombining the expressions, it follows that 

Q j ^ £ ^ £/ (t,x,D,y,s)P 0 Q <j - C =Qj^e^iP 2 ' )+) (t,x, D,y, s)P 0 Q <j - C 

+ QjdT ^ > (t,x,D,y,s)P 0 Q <j -c 


=■■ + Q4.2&0 


where 


vp, = vpf 0 ) + *(<«), vp(> 0 ) (t, x> s , j,, f) := n^(t, x, 0 - Vi mo¬ 

using (I9.40p with q = 3 and summing up in 0 > we obtain 


IT 




By the decomposability bound (19 .1 7f) with q — 6 for 4q, it follows that 

lia.i&OlUfLS-.L?, < 2-i’2 e 2^-'>2 ote -^. 

Then integrating over j — 8m < t' < t < j + 105m, we arrive at 

II / Q 4|1 (A 0«lli, r i^i.,<2-^2C { ’” 


«/ j—5m<£ f <-£<j+105m 

which is acceptable for 5 > 0 sufficiently small, since j < — |m. The term Q 4 2 is treated 
similarly, with the roles of and swapped. This completes the proof of (19.41 jh 

Step 6: Low modulation input, j < — \m, contribution of C. In this step, we establish 

\\QjC<o{t, x , D, y, s)PQQ< j - C \\ N ,^ x oj /2 < 2 5im . (9.46) 

This step is easier than Steps 4 and 5, as we do not need to get the angle separation to 
apply the decomposability bound (19.1511 : instead, we can use (19.171) . Thanks to this fact, the 
gauge transform need not be as finely localized in frequency as C and Q. Accordingly, we 
make the following decomposition: 


C =i 



V e V t ,Ve" - eJ^Lc ") dfdi'dl 


£>£'>£" >j—5m 



£>£' >£" >j—5m 
£>j+105m 


V f y t 'V e , dfdi'de 



V e V e V e , ( e<< ^</" - 1 ) dfdi'di 


j—5m<£" <£'<£< j+105m 



VtVt'Ve" d/'dl'dl 


(9.47) 

(9.48) 

(9.49) 

(9.50) 


t j—5m<£" <£'<£< j+105m 

=• C\ + C 2 + C% + C 4 . 


We treat Ci,... ,C 4 separately. 
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Step 6.1: Contribution of C\. Proceeding as in Steps 4.1 and 5.1, it follows that 
(Ci )« 0 = (i [[[ (e^c <e " ~ e ^-c ") dl"dl'dl\ 

where the outer (-)<o may be easily disposed by translation invariance. Moreover, we have 


I®*'®*" (ej""' - 


2 = 


5m 


by (I9.17P and (I9.30p . Integrating over j — 8m < £" < £' < £ < —m, this is acceptable. 

Step 6.2: Contribution of C 2 . As before, by frequency localization properties, the double 
frequency projection (-)<o leaves C 2 , C 3 and C 4 unchanged. Using (I9.17P with q = 6 for every 


factor of ifj and L 2 X boundedness of , it follows that 


l"e^_c" PoQ<j- C \\ L^Ll-^L\ x 


Integrating over {£ > £' > £" > j — 8m} D {£ > j + 105m}, this is good. 

Step 6.3: Contribution of C 3 . Here we use (19 .1 71) with q = 6 for every factor of i/j and 
(I9.2ip . Then we have 


Integrating over {j — Srn < £" <£'<£< j + 105m} and using the fact that j < —|m, we 
obtain the desired gain in m. 

Step 6.4: Contribution of C4. Summing up (I9.40P with q — 6 in 9 > 2 ak , we obtain 

WM*, x , D)a(D) - a{D)^ e (D, y, s ) < 2~l e 2 {1 - Ca)l , (9.51) 

where a(£) is any smooth bump function adapted to {|£| < 1}. Applying the decomposability 
bound (19.17P twice with q — 6 , it follows that 

\\Q j *i*e>*t»PoQ<j-c\\L ? >v^L* <2~&2&- t: )2&- t ')2&- e, W 1 - c °». 


^x TLJ t,X 


(9.52) 


We integrate this over {j — 8m <£"<£'<£< j + 105m}. Since j < —|m, the desired gain 
in m follows provided that a > 0 is sufficiently small. 

Step 7: Low modulation input, j < — |m, low frequency phase. To establish (I9.29p . 
it is only left to prove 

D,y,s) - l]a,g <# -c|| w .^Mfl < 2~‘ ,m ■ 

Since 

Qj\eHT Srn ~ 1 ]PoQ<j-c = 0 

by modulation localization, it suffices to establish 

II QAe^ < ‘-‘"(t,x,D,y,s) - ~ 

Proceeding as in Step 2, this estimate is reduced to 


<j — Sm / 
o J ' 


(+ T n 7/ — P P^W n 9 < 

{t,x,u,y,s) e <g:j-c \-ro\\LS°Ll->L ? z z • 


The last estimate follows from (19.30p . 


□ 
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9.5. Parametrix error estimate. Here we prove (19.121) . The argument here is essentially 
the same as in [T3] . 

Step 1: Decomposition of the parametrix error. At the level of left-quantized opera¬ 
tors, we compute 

^^(^1,1)) - e^’ ± {t,x,D)D 

= 2(^e-^) <0 ^ + (□e"^ ± ) < o + 2iA e < _ m e^ ± d i + 2tA e < _ m (d £ e~ i ^) <0 

= 2(uj • d x 'i/j±e~ i ^ ± ) <0 \D\ + 2{u • A<_ m e<? ± )|D| - 2{d^ ± e~ i ^) <Q D t 

- (d^ ± d^ ± e-^) <0 + 2A e < _ m (d^ ± e- i ^) <0 , 

where we are using the shorthand oj = £/|£|. This computation can be justified simply by 
using the direct definition of left-quantization, or by using the symbol calculus as in [IB]. 
On the last line, we used the fact that O0±(£, x, £) = 0 as DA = 0. 

To see the cancellation between A<_ m and we add and subtract 2(±c^± — uj ■ 

A<_ m e *^ ± )<o| D |. Then we can write 

□^< 0 ^ - e <o / ' ± n = - 2((±<9^± -u-d x ^ ± -u- A < _ ro )e-^ ± ) |D| 

V / <0 

-2(^ ± e- ife )<o(Al|5|) 

- (-{dt^±d t ip± + d x il>± ■ d x ^ ± )e~ H,± ) <0 
+ 2A<_ m • 

+ 2[(u-A<_ m ,,S < o]e-^ ± | J D| 

=: Diffi + Diff 2 + Diff 3 + Diff 4 + Diff 5 . 

Step 2: Estimate for Diff 4 . Being highest order, this is a-priori the most dangerous term. 
This is precisely the point where we need a > 0. In this step we prove 

UDiffiPolk-^JV < 2~^ m + 2~ m . (9.53) 

Step 2.1: Preliminary reduction. By (19.4p . it follows that 

Diffi = -2( 22 n < 2 ^(e- A k )e-^ (t,x,D). 

k<—m 

Note that e _l At can be replaced by by the frequency localization of A. The outer 

(•)<o can be easily disposed by translation invariance. Therefore, it suffices to consider 

Si := 2 22 ( n < 2 ^(£ • A k )e^ ± ){t,x,D)P 0 

k<—m 

Step 2.2: Reduction to bilinear estimate. Our next order of business is to remove 
For this purpose, consider the operator 

S 2 2 2^ (n ^ 2ak (^ ■ A k ))(t,x,D)e^ ± (t,x,D)P 0 

k<—m 

\\Si — < 2 
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We claim that 


(9.54) 






This estimate contributes the term 2 m in (19.531) . as N* C LfL\ and L]L 2 C N. 

Thanks to frequency localization of e < ^f ± , we can harmlessly insert an operator a(D ) 
between the two pseudodifferential operators in S 2 , where a(£) is a smooth bump function 
adapted to {|£| ~ 1}. Then by Lemma 19. 5[ it follows that 


114 — ^2\\l^LI^L}LI 


< Y. l|3s(n^..(OAOaK))llDL>i.||(-^ ± e-* 

k<—m 


<C IlLS 


DL x^ L t, X 


Note that 

4( n <2-4 • A k )a(0) = (||a(0 + |el^a(0)n^(o; • A k ) + |£|a(£)4(I^ 2 <4^ • A k )) 

The factors involving only £ can easily be removed as they are bounded. Invoking (19.161) 
and summing over 6 < 2 ak and k < —m, it follows that 

Y H 9 «( n <2»‘({ ' < Y 2ik2i °" ~ 2 - 4<1+ ’ 1 ’". (9.55) 

k<—m k<—m 


On the other hand, summing (19.15ft over 6 > 2 ak and k < —m, we obtain 

< 2 -’ (1 -"> m (9.56) 

Now replacing eby e^cf, removing the outer (-)<c by translation invariance as usual 
and using (19.561) . we obtain 


11(4 


3 -#±' 


<c\\l\ 






Combining (19.55j) and (19.561) . estimate (19,54j) follows. 


Step 2.3: Bilinear estimate. It is now only left to treat S 2 . Note that the operator 
e ~^(t, x ,D)P 0 can be easily removed at this point, as it is bounded on Nq. Therefore, it 
suffices to show 

II U^ k A k (t,x,D) ■ d x P 0 \\ N ^ N <2~^ m 

k<—m 

where Pq is a slightly enlarged version of Pq. 

Recall that n< 2CTit localizes A k into angular sectors of size ~ 2 ak centered at u — t|j (close- 
angle) and — id (far-angle). Therefore, by a Whitney-type decomposition in angles, it suffices 
to consider the sum 


Y E E ( p ‘ p C) - d * p « p ? + E E (p^a) ■ 

k<—ml<ak <f,$ k<—m 

dist((/>,</>')~ 2 f dist(0,0')~l 


where the first sum corresponds to the close-angle interaction, and the second sum corre¬ 
sponds to the far-angle interaction. 

We begin by treating the close-angle interaction. We split this sum into two cases, de¬ 
pending on whether the input modulation is > k + 2£ — C or otherwise. 

Step 2.3.1: Close-angle, high modulation input. By the sharp LfL ® Strichartz esti¬ 
mate and Bernstein, we have 

link^IlL^ < 22 l 2K 
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(9.57) 















We estimate the output in L}L 2 X and the input in , using (I9.57P for P k Pf A. Note that, 
thanks to the null structure in A ■ d x , we also gain a factor of 2 l . Using £ 2 summability in 
angles for A and the input, we obtain 

X] || (P k PfA) ■ d x P 0 PfQ>k + 2e-c\\ x oj/^ LltL2x < 2 K 

Summing over £ < ak and then k < —m, the desired gain of 2~^ am follows. 

Step 2.3.2: Close-angle, low modulation input. In this case, by elementary geometry 
of the cone, the output modulation is ~ 2 k+2£ . Placing the output in 1 and the input 
in L^°L 2 , the numerology is the same as in Step 2.3.1 and we obtain a gain of 2~^ am . 

Step 2.3.3: Far-angle. We proceed as in the case of close-angle interaction, this time 
splitting the input into Q >k -c + Q<k~c- In this case we do not gain from the null structure, 
but obtain the desired gain 2^ ak from (19.571) . 

Step 3: Estimate for Diff 2 . Here we need to use the S± norm. We claim that 


l|Diff2|ls»^v<2- m 

This estimate follows from the obvious mapping property 

D t t \D\ :Si^N. 


and estimate (j9.9[) . 


Step 4: Estimate for Diff 3 and Diff 4 . Again, we replace e ^ by e < ^f ± , and dispose the 
outer (-)<o by translation invariance. Summing up (I9.15P in 9 > 2 ak , we have 

On the other hand, since A k = P k A is independent of £, it follows from Strichartz that 

\\Ak\\DLlL™ ^ ll^fc||l, t 2 L£“ ^ 2 2 11 [0] ll^fixL 2 

Then by decomposability and L 2 X boundedness of it follows that 


|| Diff 3 + Diff 


i\\L™Ll^L\Ll ru 


<C 


which is enough. 

Step 5: Estimate for Diff 5 . For each component, the commutator may be written as 


[A < _ m ,^ <o ](0)=L(VA < _ m ,(/») 


where L is a translation invariant bilinear operator with an integrable kernel. Using this 
expression, we now proceed as in Step 1. Summation in k < —m is now possible thanks to 
the extra derivative V, and we obtain 


||Diff 5 ||iv^iv<2- m . 

Combining Steps 1-5, estimate (I9.12p follows. 
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